Methods and materials for treatment of pompe&#39;s disease

ABSTRACT

This document relates to molecular complexes having acid alpha glucosidase activity and at least one modification that results in enhanced ability of the molecular complex to be transported to the interior of a mammalian cell.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application Ser. No. 61/611,485, filed Mar. 15, 2012. The disclosure of the prior application is considered part of (and is incorporated by reference in) the disclosure of this application.

TECHNICAL FIELD

This invention relates to isolated molecular complexes having acid alpha glucosidase activity, and more particularly to molecular complexes comprising at least two polypeptides derived by proteolysis from a precursor molecule, wherein the molecular complex includes at least one modification that results in enhanced ability of the molecular complex to be transported to the interior of a mammalian cell.

BACKGROUND

Pompe's disease (also referred to as glycogen-storage disease type II or acid-maltase deficiency) is a rare autosomal recessive disorder that results in an accumulation of glycogen in the lysosome due to a deficiency of acid alpha glucosidase (GAA). The build-up of glycogen causes progressive muscle weakness (myopathy) throughout the body and affects various body tissues, including the heart, skeletal muscles, live, and nervous system.

Pompe's disease is broadly classified into infantile and late onset forms. In the infantile-onset form, infants typically present during early infancy (4-8 months of age) with weakness and floppiness, and are unable to hold up their heads and cannot do other motor tasks common for their age, such as rolling over. Without treatment, infants with Pompe's disease usually die before 12 months of age due to heart failure and respiratory weakness. See, United Pompe Foundation. Late onset forms (including juvenile and adult forms), have a later onset and progress more slowly than the infantile form. Recombinant human GAA (Myozyme® or Lumizyme®) is used to treat Pompe's disease. However, Myozyme® or Lumizyme® are both very expensive, with costs well over $300,000 per year. As such, there is a need for improved treatments for Pompe's disease.

SUMMARY

In one aspect, this document features an isolated molecular complex having acid alpha glucosidase (GAA) activity and that includes at least two polypeptides (e.g., at least three or at least four polypeptides), each polypeptide having at least 85% (e.g., at least 90%, 95%, 99%, or 100%) sequence identity to a segment of the amino acid sequence set forth in SEQ ID NO: 1, each segment being derived by proteolysis of the amino acid sequence set forth in SEQ ID NO: 1 at one or more sites between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65). The molecular complex includes at least one modification that results in enhanced ability of the molecular complex to be transported to the interior of a mammalian cell. Proteolysis of the amino acid sequence set forth in SEQ ID NO:1 further can include cleavage at one or more sites between amino acid 719 and amino acid 746 or cleavage at one or more sites between amino acid 137 and amino acid 151 of the amino acid sequence set forth in SEQ ID NO:1. Proteolysis further can include cleavage at one or more sites between amino acid 719 and amino acid 746 of the amino acid sequence set forth in SEQ ID NO:1 and cleavage at one or more sites between amino acid 137 and amino acid 151 of the amino acid sequence set forth in SEQ ID NO:1.

In any of the molecular complexes described herein, at least one of the polypeptides can include one or more phosphorylated N-glycans and the modification can include uncapping and demannosylation of at least one phosphorylated N-glycan. At least 40% (e.g., at least 60%, 80%, 90%, 95%, or 99%) of the N-glycans on at least one of the polypeptides can be uncapped and demannosylated.

In any of the molecular complexes described herein, for one of the at least two polypeptides, the segment includes amino acids 22 to 57 of SEQ ID NO:1, and wherein for one of the at least two polypeptides, the segment includes amino acids 66 to 896 of SEQ ID NO:1.

In any of the molecular complexes described herein containing at least three polypeptides, for one of the at least three polypeptides, the segment includes amino acids 22 to 57 of SEQ ID NO:1, wherein for one of the at least three polypeptides, the segment includes amino acids 66 to 726 of SEQ ID NO:1, and wherein for one of the at least three polypeptides, the segment includes amino acids 736 to 896 of SEQ ID NO:1.

In any of the molecular complexes described herein containing at least four polypeptides, for one of the at least four polypeptides, the segment includes amino acids 22 to 57 of SEQ ID NO:1, wherein for one of the at least four polypeptides, the segment includes amino acids 66 to 143 of SEQ ID NO:1, wherein for one of the at least four polypeptides, the segment includes amino acids 158 to 726 of SEQ ID NO:1, and wherein for one of the at least four polypeptides, the segment includes amino acids 736 to 896 of SEQ ID NO:1.

In any of the molecular complexes described herein, the at least one modification can include any one of the following fused to at least one polypeptide in the molecular complex: a ligand for an extracellular receptor, a targeting domain that binds an extracellular domain of a receptor on the surface of a target cell, a urokinase-type plasminogen receptor, or the recognition domain of human insulin-like growth factor II.

This document also features compositions that include any of the molecular complexes described herein, wherein the molecular complex is lyophilized. The composition can be packaged as a single use vial.

This document also features a pharmaceutical composition that includes any of the molecular complexes described herein and a pharmaceutically acceptable carrier. The composition can be formulated for intravenous or subcutaneous administration. The composition can be formulated for intravenous infusion.

In another aspect, this document features a method of treating Pompe's disease. The method includes administering any of the compositions described herein to a patient diagnosed with Pompe's disease. The patient can be diagnosed with infantile onset Pompe's disease or late onset Pompe's disease.

This document also features a method for making a molecular complex. The method includes contacting a polypeptide having at least 85% sequence identity to the amino acid sequence set forth in SEQ ID NO:1 with a protease having at least 85% (e.g., at least 90%, at least 95%, at least 99%, or 100%) sequence identity to the amino acid sequence set forth in SEQ ID NO:8, wherein the protease cleaves the polypeptide at one or more sites between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65). The contacting step can be performed in vitro.

This document also features a method for making a molecular complex that includes uncapped and demannosylated phosphorylated N-glycans. The method includes contacting a molecular complex with a mannosidase capable of (i) hydrolyzing a mannose-1-phospho-6-mannose moiety to mannose-6-phosphate and (ii) hydrolyzing terminal alpha-1,2 mannose, alpha-1,3 mannose and/or alpha-1,6 mannose linkages, the molecular complex having GAA activity and including at least two polypeptides, each polypeptide having at least 85% sequence identity to a segment of the amino acid sequence set forth in SEQ ID NO: 1, each segment being derived by proteolysis of the amino acid sequence set forth in SEQ ID NO: 1 at one or more sites between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65), wherein before the contacting, at least one of the polypeptides includes phosphorylated N-glycans containing one or more mannose-1-phospho-6-mannose moieties. The mannosidase can be a family 38 glycosyl hydrolase (e.g., a Canavalia ensiformis mannosidase or a Yarrowia lipolytica mannosidase). The contacting can occur in a recombinant fungal cell expressing the mannosidase.

This document also features a method of making a molecular complex that includes uncapped and demannosylated phosphorylated N-glycans. The method includes contacting a molecular complex with a mannosidase capable of hydrolyzing terminal alpha-1,2 mannose, alpha-1,3 mannose and/or alpha-1,6 mannose linkages, the molecular complex having GAA activity and comprising at least two polypeptides, each polypeptide having at least 85% sequence identity to a segment of the amino acid sequence set forth in SEQ ID NO: 1, each segment being derived by proteolysis of the amino acid sequence set forth in SEQ ID NO: 1 at one or more sites between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65), wherein at least one of the polypeptides includes prior to contacting, phosphorylated N-glycans comprising uncapped mannose-6-phosphate moieties. The mannosidase can be a family 47 glycosyl hydrolase (e.g., an Aspergillus satoi mannosidase), a family 92 glycosyl hydrolase (e.g., a Cellulosimicrobium cellulans mannosidase), or a family 38 glycosyl hydrolase (e.g., a Canavalia ensiformis mannosidase). The contacting can occur in a recombinant fungal cell expressing the mannosidase.

This document also features a method of making a molecular complex that includes uncapped and demannosylated phosphorylated N-glycans. The method includes contacting a molecular complex with a mannosidase capable of hydrolyzing a mannose-1-phospho-6-mannose moiety to mannose-6-phosphate, the molecular complex having GAA activity and including at least two polypeptides, each polypeptide having at least 85% sequence identity to a segment of the amino acid sequence set forth in SEQ ID NO: 1, each segment being derived by proteolysis of the amino acid sequence set forth in SEQ ID NO: 1 at one or more sites between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65), wherein at least one of the polypeptides includes, before the contacting, one or more mannose-1-phospho-6-mannose moieties. The mannosidase can be a family 38 glycosyl hydrolase (e.g., a Canavalia ensiformis mannosidase or a Yarrowia lipolytica mannosidase).

In another aspect, this document features a method of making a molecular complex that includes uncapped and demannosylated phosphorylated N-glycans. The method includes a) contacting a molecular complex with a mannosidase capable of hydrolyzing a mannose-1-phospho-6-mannose moiety to mannose-6-phosphate to uncap mannose-6-phosphate moieties on at least one polypeptide in the molecular complex, the molecular complex having GAA activity and comprising at least two polypeptides, each polypeptide having at least 85% sequence identity to a segment of the amino acid sequence set forth in SEQ ID NO: 1, each segment being derived by proteolysis of the amino acid sequence set forth in SEQ ID NO: 1 at one or more sites between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65); and b) contacting the molecular complex with a mannosidase capable of hydrolyzing terminal alpha-1,2 mannose, alpha-1,3 mannose and/or alpha-1,6 mannose linkages. Step (a) and step (b) can be catalyzed by two different enzymes or catalyzed by a single enzyme. The contacting steps can be performed together or separately, and in either order. The contacting can occur in a recombinant fungal host cell, the fungal host cell expressing a mannosidase capable of catalyzing step (a) and a mannosidase capable of catalyzing step (b). The contacting can occur in a recombinant fungal host cell, the fungal host expressing a mannosidase capable of catalyzing steps (a) and (b).

Any of the molecular complexes described herein that include at least one uncapped and demannosylated N-glycan can be used to contact a mammalian cell, wherein, after the contacting, the molecular complex is transported to the interior of the mammalian cell with enhanced efficiency. The mammalian cell can be a human cell.

This document also features a method of transporting a molecular complex having GAA activity to the interior of a cell. The method includes contacting a mammalian cell with the molecular complex, the molecular complex including at least two polypeptides, each polypeptide having at least 85% sequence identity to a segment of the amino acid sequence set forth in SEQ ID NO: 1, each segment being derived by proteolysis of the amino acid sequence set forth in SEQ ID NO: 1 at one or more sites between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65); wherein phosphorylated N-glycans on at least one of the polypeptides have been uncapped and demannosylated as set forth in the methods described herein. The mammalian cell can be in vitro or in a mammalian subject. The mammalian cell can be a human cell.

In another aspect, this document features a method of transporting a molecular complex having GAA activity to the interior of a cell. The method includes contacting a mammalian cell with the molecular complex that includes at least two polypeptides, each polypeptide having at least 85% sequence identity to a segment of the amino acid sequence set forth in SEQ ID NO: 1, each segment being derived by proteolysis of the amino acid sequence set forth in SEQ ID NO: 1 at one or more sites between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65), the molecular complex comprising at least one modification that results in enhanced ability of the molecular complex to be transported to the interior of a mammalian cell. The mammalian cell can be in vitro or in a mammalian subject. The mammalian cell can be a human cell. The modification can include any one of the following fused to at least one polypeptide in the molecular complex: a ligand for an extracellular receptor, a targeting domain that binds an extracellular domain of a receptor on the surface of a target cell, a urokinase-type plasminogen receptor, or the recognition domain of human insulin-like growth factor II.

In another aspect, this document features an isolated fungal cell that includes an exogenous nucleic acid encoding an alkaline protease having at least 85% sequence identity to the amino acid sequence set forth in SEQ ID NO:8.

This document also features an isolated fungal cell comprising a nucleic acid encoding the GAA amino acid sequence set forth in SEQ ID NO:1 and a nucleic acid encoding an alkaline protease having at least 85% sequence identity to the amino acid sequence set forth in SEQ ID NO:8. The fungal cell produces a molecular complex having GAA activity and comprising at least two polypeptides, each polypeptide having at least 85% sequence identity to a segment of the amino acid sequence set forth in SEQ ID NO: 1, each segment being derived by proteolysis of the amino acid sequence set forth in SEQ ID NO: 1 at one or more sites between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65) by the alkaline protease. In some embodiments, the fungal cell further comprises a nucleic acid encoding a mannosidase, the mannosidase being capable of hydrolyzing a mannose-1-phospho-6-mannose moiety to mannose-6-phosphate. In some embodiments, the fungal cell further includes a nucleic acid encoding a mannosidase, the mannosidase being capable of hydrolyzing a terminal alpha-1,2 mannose, alpha-1,3 mannose and/or alpha-1,6 mannose linkage. In some embodiments, the fungal cell further can include a nucleic acid encoding a mannosidase, the mannosidase being capable of (i) hydrolyzing a mannose-1-phospho-6-mannose moiety to mannose-6-phosphate and (ii) hydrolyzing a terminal alpha-1,2 mannose, alpha-1,3 mannose and/or alpha-1,6 mannose linkage. Any of such fungal cells further can include a nucleic acid encoding a polypeptide capable of promoting mannosyl phosphorylation and/or be genetically engineered to be deficient in OCH1 activity.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the exemplary methods and materials are described below. All publications, patent applications, patents, Genbank® Accession Nos, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present application, including definitions, will control. The materials, methods, and examples are illustrative only and not intended to be limiting.

Other features and advantages of the invention will be apparent from the following detailed description, and from the claims.

DESCRIPTION OF DRAWINGS

FIG. 1 is a depiction of the amino acid sequence (SEQ ID NO:1) of human acid alpha glucosidase (GAA) after cleavage of the signal sequence.

FIG. 2A is a depiction of the nucleotide sequence of the open reading frame (ORF) of DsbA-Cellulosimicrobium cellulans mannosidase 5 (CcMan5) (SEQ ID NO:2).

FIG. 2B is a depiction of the amino acid sequence of the CcMan5 polypeptide with the signal sequence in bold (SEQ ID NO: 3).

FIG. 2C is a depiction of the amino acid sequence of the CcMan5 polypeptide without signal sequence (SEQ ID NO:4). The predicted molecular weight of the CcMan5 polypeptide without the signal sequence is 173 kDa.

FIGS. 3A and 3B are a series of electropherograms depicting the N-glycan analysis of rhGAA treated with CcMan5 and JbMan. Analysis was performed using DNA sequencer-assisted, fluorophore-assisted carbohydrate electrophoresis (DSA-FACE). The Y-axis represents the relative fluorescence units as an indication of the amount of each N-glycan structure. The X-axis represents the relative mobility of each N-glycan structure through a capillary. In both FIG. 3A and FIG. 3B, panel A is a reference sample containing the N-glycans released from RNaseB with PNGaseF. In FIG. 3A, panels B and C contain the N-glycan analysis from huGAA (76 kD variant) before and after treatment, respectively, with CcMan5 and JbMan. In FIG. 3B, panels B, C, and D contain the N-glycan analysis from huGAA 76 kD form, 95 kD form, and 110 kD form, respectively.

FIG. 4 is a line graph of the amount of glucose formed per minute with Myozyme (•), 76 kDa GAA (▴), 95 kDa GAA (▾), and 110 kDa GAA (♦) using rabbit liver glycogen as substrate.

FIG. 5A contains two depictions of the glycogen levels (μg/mg protein) of individual mice in heart. FIG. 5B contains two depictions of the glycogen levels (μg/mg protein) of individual mice in skeletal muscle. Unfilled dots are females, black-filled dots are males. Line represents the median of each group.

FIG. 6 contains a depiction of the amino acid sequence of the Yarrowia lipolytica AMS1 mannosidase (SEQ ID NO: 5).

FIG. 7 contains a depiction of the amino acid sequence of the Aspergillus satoi mannosidase (SEQ ID NO:6).

FIG. 8 contains a depiction of the amino acid sequence of the Cellulosimicrobium cellulans mannosidase 4 (CcMan4, SEQ ID NO:7), with signal sequence in bold. The predicted molecular weight of the CcMan4 polypeptide without the signal sequence is 184 kDa.

FIG. 9 contains a depiction of the amino acid sequence of the Aspergillus oryzae alkaline protease including the signal peptide (21 amino acids), pro-peptide (100 amino acids) and mature protein (282 amino acids) (SEQ ID NO:9).

FIG. 10 contains a depiction of the nucleotide sequence of the fusion construct containing the Y. lipolytica codon optimized sequence encoding the A. oryzae alkaline protease (SEQ ID NO:10). Restriction sites used for cloning are underlined. The nucleotide sequence encoding the linker is in bold and the nucleotide sequence encoding the His tag (10 His residues) is italicized.

DETAILED DESCRIPTION

In general, this document provides isolated molecular complexes having acid alpha-glucosidase (GAA) activity and at least one modification that results in an enhanced ability to be transported to the interior of a mammalian cell. GAA is synthesized as a 110 kDa precursor containing N-linked glycans. The precursor is proteolytically processed to remove the signal sequence and then further proteolytically processed to major species of 95 kDa, 76 kDa, and 70 kDa. However, at least some of the peptides that are released from the precursor remain associated with the major species. See, for example, Moreland et al., J. Biol. Chem., 280:6780-6791 (2005). Thus, the molecular complexes having GAA activity described herein include at least two polypeptides (at least two, three, or four polypeptides) that are derived from proteolytic cleavage of the precursor molecule at one or more sites. At least two polypeptides in the molecular complex result from proteolytic cleavage at one or more sites in the precursor. For example, proteolysis of the amino acid sequence set forth in SEQ ID NO:1 can be between amino acid 50 and amino acid 74, e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65, to produce at least two polypeptides. A molecular complex containing two polypeptides is referred to as the 95 kDa form herein.

In some embodiments, at least three polypeptides in the molecular complex result from proteolytic cleavage at two or more sites in the precursor. For example, proteolysis of the amino acid sequence set forth in SEQ ID NO:1 can include, in addition to cleavage between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65), cleavage at one or more sites between amino acid 719 and amino acid 746 or cleavage at one or more sites between amino acid 137 and amino acid 151 of the amino acid sequence set forth in SEQ ID NO:1. A molecular complex containing three polypeptides is referred to as the 76 kDa form herein.

In some embodiments, at least four polypeptides in the molecular complex result from proteolytic cleavage at three or more sites in the precursor. For example, proteolysis of the amino acid sequence set forth in SEQ ID NO:1 can include, in addition to the cleavage between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65), cleavage at one or more sites between amino acid 719 and amino acid 746 of the amino acid sequence set forth in SEQ ID NO:1 and cleavage at one or more sites between amino acid 137 and amino acid 151 of the amino acid sequence set forth in SEQ ID NO:1. A molecular complex containing four polypeptides is referred to as the 70 kDa form herein.

It will be appreciated that cleavage can occur at one or more sites in one molecule, and that the site of cleavage can be different in different molecules.

A commercially available protease mix containing proteases from Aspergillus oryzae (e.g., from Sigma or NovozymesCorp) can be used to cleave the amino acid sequence set forth in SEQ ID NO:1 between amino acids 50 and 74, e.g., between amino acids 56 and 68 or between amino acids 60 and 65. Alternatively, an alkaline protease having at least 85% (e.g., at least 90%, 95%, 97%, 98%, 99%, or 100%) sequence identity to the alkaline protease from Aspergillus oryzae (SEQ ID NO:8) can be used. For example, as described herein, a GAA polypeptide having the amino acid sequence set forth in SEQ ID NO:1 can be contacted with a protease having at least 85% sequence identity to the amino acid sequence set forth in SEQ ID NO:8 or SEQ ID NO: 9. SEQ ID NO: 8 is the amino acid sequence of the mature Aspergillus oryzae alkaline protease. SEQ ID NO: 9 is the amino acid sequence of the Aspergillus oryzae protease including the signal peptide, pro-peptide, and mature protein. The contacting can occur in vitro using protease that has been isolated from Aspergillus oryzae or that has been recombinantly produced. Alternatively, a fungal host can be engineered such that the GAA polypeptide and alkaline protease are both secreted into the culture medium, where the alkaline protease can cleave the amino acid sequence set forth in SEQ ID NO:1 between amino acid 50 and amino acid 74 (e.g., between amino acids 56 and 68 or between amino acids 60 and 65).

The isolated molecular complexes described herein have at least one modification that results in an enhanced ability to be transported to the interior of a mammalian cell. Non-limiting examples of modifications that enhance the ability of the complex of being transported to the interior of a mammalian cell include uncapping and demannosylation of phosphorylated N-glycans or peptide tags that facilitate transport. Methods and materials are described herein for preparing molecular complexes containing tags or uncapped and demannosylated N-glycans.

The isolated molecular complexes described herein are particularly useful for treating patients with Pompe disease, including a patient diagnosed with Pompe's disease, both infantile onset Pompe's disease and late onset Pompe's disease. Pompe's disease results in an accumulation of glycogen in the lysosome due to a deficiency of GAA. The build-up of glycogen causes progressive muscle weakness (myopathy) throughout the body and affects various body tissues, including the heart, skeletal muscles, live, and nervous system.

Each of the polypeptide in the molecular complex have at least 85% sequence identity (e.g., at least 90%, 95%, 97%, 98%, 99%, or 100%) to a segment of the amino acid sequence set forth in SEQ ID NO: 1, each segment being derived by proteolysis of the amino acid sequence set forth in SEQ ID NO: 1 at one or more sites between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65). The percent identity between a particular amino acid sequence and the amino acid sequence set forth in SEQ ID NO: 1 can be determined as follows. First, the amino acid sequences are aligned using the BLAST 2 Sequences (B12seq) program from the stand-alone version of BLASTZ containing BLASTP version 2.0.14. This stand-alone version of BLASTZ can be obtained from Fish & Richardson's web site (e.g., www.fr.com/blast/) or the U.S. government's National Center for Biotechnology Information web site (www.ncbi.nlm.nih.gov). Instructions explaining how to use the Bl2seq program can be found in the readme file accompanying BLASTZ. Bl2seq performs a comparison between two amino acid sequences using the BLASTP algorithm. To compare two amino acid sequences, the options of Bl2seq are set as follows: —i is set to a file containing the first amino acid sequence to be compared (e.g., C:\seq1.txt); —j is set to a file containing the second amino acid sequence to be compared (e.g., C:\seq2.txt); —p is set to blastp; —o is set to any desired file name (e.g., C:\output.txt); and all other options are left at their default setting. For example, the following command can be used to generate an output file containing a comparison between two amino acid sequences: C:\B12seq—i c:\seq1.txt—j c:\seq2.txt—p blastp—o c:\output.txt. If the two compared sequences share homology, then the designated output file will present those regions of homology as aligned sequences. If the two compared sequences do not share homology, then the designated output file will not present aligned sequences. Similar procedures can be following for nucleic acid sequences except that blastn is used.

Once aligned, the number of matches is determined by counting the number of positions where an identical amino acid residue is presented in both sequences. The percent identity is determined by dividing the number of matches by the length of the full-length polypeptide amino acid sequence followed by multiplying the resulting value by 100.

It is noted that the percent identity value is rounded to the nearest tenth. For example, 78.11, 78.12, 78.13, and 78.14 is rounded down to 78.1, while 78.15, 78.16, 78.17, 78.18, and 78.19 is rounded up to 78.2. It also is noted that the length value will always be an integer.

It will be appreciated that a number of nucleic acids can encode a polypeptide having a particular amino acid sequence. The degeneracy of the genetic code is well known to the art; i.e., for many amino acids, there is more than one nucleotide triplet that serves as the codon for the amino acid. For example, codons in the coding sequence for a given GAA polypeptide can be modified such that optimal expression in a particular species (e.g., bacteria or fungus) is obtained, using appropriate codon bias tables for that species.

In one embodiment, a molecular complex can include at least two polypeptides, where one of the polypeptides includes amino acids 22 to 57 of SEQ ID NO:1, and another polypeptide includes amino acids 66 to 896 of SEQ ID NO:1.

In one embodiment, a molecular complex can include at least three polypeptides, wherein one of the polypeptides includes amino acids 22 to 57 of SEQ ID NO:1, one polypeptide includes amino acids 66 to 726 of SEQ ID NO:1, and one polypeptide includes amino acids 736 to 896 of SEQ ID NO:1.

In one embodiment, a molecular complex can include at least four polypeptides, wherein one of the polypeptides includes amino acids 22 to 57 of SEQ ID NO:1, one polypeptide includes amino acids 66 to 143 of SEQ ID NO:1, one polypeptide includes amino acids 158 to 726 of SEQ ID NO:1, and one polypeptide includes amino acids 736 to 896 of SEQ ID NO:1.

Biologically active variants of GAA can contain additions, deletions, or substitutions relative to the sequences set forth in SEQ ID NO: 1. GAA proteins with substitutions will generally have not more than 10 (e.g., not more than one, two, three, four, five, six, seven, eight, nine, or ten) conservative amino acid substitutions. A conservative substitution is the substitution of one amino acid for another with similar characteristics. Conservative substitutions include substitutions within the following groups: valine, alanine and glycine; leucine, valine, and isoleucine; aspartic acid and glutamic acid; asparagine and glutamine; serine, cysteine, and threonine; lysine and arginine; and phenylalanine and tyrosine. The non-polar hydrophobic amino acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan and methionine. The polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine and glutamine. The positively charged (basic) amino acids include arginine, lysine and histidine. The negatively charged (acidic) amino acids include aspartic acid and glutamic acid. Any substitution of one member of the above-mentioned polar, basic or acidic groups by another member of the same group can be deemed a conservative substitution. By contrast, a non-conservative substitution is a substitution of one amino acid for another with dissimilar characteristics.

In some embodiments, a GAA polypeptide can be a fusion protein with a heterologous amino acid sequence such as a sequence used for purification of the recombinant protein (e.g., FLAG, polyhistidine (e.g., hexahistidine), hemagluttanin (HA), glutathione-S-transferase (GST), or maltose-binding protein (MBP)).

In some embodiments, the heterologous amino acid sequence is used to enhance the efficiency of transport of the molecular complex into a mammalian cell. For example, at least one of the polypeptides in a complex can be fused to a ligand for an extracellular receptor, a targeting domain that binds an extracellular domain of a receptor on the surface of a target cell, a urokinase-type plasminogen receptor, or domains of human insulin-like growth factor II that bind to the mannose-6-phosphate receptor (e.g., amino acids 1-67 or 1-87; at least amino acids 48-55; at least amino acids 8-28 and 41-61; or at least amino acids 8-87 of human insulin-like growth factor; a sequence variant thereof of human insulin-like growth factor II (e.g., R68A) or truncated form of human insulin-like growth factor (e.g., C-terminally truncated from position 62)). The heterologous amino acid sequence can be fused at the N-terminus or C-terminus of the polypeptide. In one embodiment, a peptide tag is fused to the N- or C-terminus of the polypeptide by a spacer (e.g., 5-30 amino acids or 10-25 amino acids). See, for example, U.S. Pat. No. 7,785,856.

Heterologous amino sequences also can be proteins useful as diagnostic or detectable markers, for example, luciferase, green fluorescent protein (GFP), or chloramphenicol acetyl transferase (CAT).

In certain host cells (e.g., yeast host cells), expression and/or secretion of the target protein can be increased through use of a heterologous signal sequence. In some embodiments, the fusion protein can contain a carrier (e.g., KLH) useful, e.g., in eliciting an immune response for antibody generation) or endoplasmic reticulum or Golgi apparatus retention signals. Heterologous sequences can be of varying length and in some cases can be a longer sequences than the full-length target proteins to which the heterologous sequences are attached.

Methods of Demannosylating, or Uncapping and Demannosylating Glycoproteins

Glycoproteins containing a phosphorylated N-glycan can be demannosylated, and glycoproteins containing a phosphorylated N-glycan containing a mannose-1-phospho-6-mannose linkage or moiety can be uncapped and demannosylated by contacting the glycoprotein with a mannosidase capable of (i) hydrolyzing a mannose-1-phospho-6-mannose linkage or moiety to mannose-6-phosphate and (ii) hydrolyzing a terminal alpha-1,2 mannose, alpha-1,3 mannose and/or alpha-1,6 mannose linkage or moiety. Non-limiting examples of such mannosidases include a Canavalia ensiformis (Jack bean) mannosidase and a Yarrowia lipolytica mannosidase (e.g., AMS1). Both the Jack bean and AMS1 mannosidase are family 38 glycoside hydrolases.

The Jack bean mannosidase is commercially available, for example, from Sigma-Aldrich (St. Louis, Mo.) as an ammonium sulfate suspension (Catalog No. M7257) and a proteomics grade preparation (Catalog No. M5573). Such commercial preparations can be further purified, for example, by gel filtration chromatography to remove contaminants such as phosphatases. The Jack bean mannosidase contains a segment with the following amino acid sequence NKIPRAGWQIDPFGHSAVQG (SEQ ID NO: 11). See Howard et al., J. Biol. Chem., 273(4):2067-2072, 1998.

The Yarrowia lipolytica AMS1 mannosidase can be recombinantly produced. The amino acid sequence of the AMS1 polypeptide is set forth in SEQ ID NO:5 (see also FIG. 6).

In some embodiments, the uncapping and demannosylating steps are catalyzed by two different enzymes. For example, uncapping of a mannose-1-phospho-6 mannose linkage or moiety can be performed using a mannosidase from Cellulosimicrobium cellulans (e.g., CcMan5). The nucleotide sequence encoding the CcMan5 polypeptide is set forth in SEQ ID NO:2 (see FIG. 2A). The amino acid sequence of the CcMan5 polypeptide containing signal sequence is set forth in SEQ ID NO: 3 (see FIG. 2B). The amino acid sequence of the CcMan5 polypeptide without signal sequence is set forth in SEQ ID NO:4 (see FIG. 2C). In some embodiments, a biologically active fragment of the CcMan5 polypeptide is used. For example, a biologically active fragment can includes residues 1-774 of the amino acid sequence set forth in SEQ ID NO:4. See also WO 2011/039634. The CcMan5 mannosidase is a family 92 glycoside hydrolase.

Demannosylation of an uncapped glycoprotein or molecular complexes of glycoproteins can be catalyzed using a mannosidase from Aspergillus satoi (As) (also known as Aspergillus phoenicis) or a mannosidase from Cellulosimicrobium cellulans (e.g., CcMan4). The Aspergillus satoi mannosidase is a family 47 glycoside hydrolase and the CcMan4 mannosidase is a family 92 glycoside hydrolase. The amino acid sequence of the Aspergillus satoi mannosidase is set forth in FIG. 7 (SEQ ID NO:6) and in GenBank Accession No. BAA08634. The amino acid sequence of the CcMan4 polypeptide is set forth in FIG. 8 (SEQ ID NO:7).

Demannosylation of an uncapped glycoprotein or molecular complexes of glycoproteins also can be catalyzed using a mannosidase from the family 38 glycoside hydrolases such as a Canavalia ensiformis (Jack bean) mannosidase or a Yarrowia lipolytica mannosidase (e.g., AMS1). For example, CcMan5 can be used to uncap a mannose-1-phospho-6 mannose moiety on a glycoprotein (or molecular complex of glycoproteins) and the Jack bean mannosidase can be used to demannosylate the uncapped glycoprotein (or molecular complex of glycoproteins).

To produce demannosylated glycoproteins (or molecular complexes of glycoproteins), or uncapped and demannosylated glycoproteins (or molecular complexes of glycoproteins), a target molecule (or molecular complex) containing a mannose-1-phospho-6 mannose linkage or moiety is contacted under suitable conditions with a suitable mannosidase(s) and/or a cell lysate containing a suitable recombinantly produced mannosidase(s). Suitable mannosidases are described above. The cell lysate can be from any genetically engineered cell, including a fungal cell, a plant cell, or animal cell. Non-limiting examples of animal cells include nematode, insect, plant, bird, reptile, and mammals such as a mouse, rat, rabbit, hamster, gerbil, dog, cat, goat, pig, cow, horse, whale, monkey, or human.

Upon contacting the target molecule (e.g., molecular complex) with the purified mannosidases and/or cell lysate, the mannose-1-phospho-6-mannose linkage or moiety can be hydrolyzed to phospho-6-mannose and the terminal alpha-1,2 mannose, alpha-1,3 mannose and/or alpha-1,6 mannose linkage or moiety of such a phosphate containing glycan can be hydrolyzed to produces an uncapped and demannosylated target molecule. In some embodiments, one mannosidase is used that catalyzes both the uncapping and demannosylating steps. In some embodiments, one mannosidase is used to catalyze the uncapping step and a different mannosidase is used to catalyze the demannosylating step. Following processing by the mannosidase, the target molecule or molecular complex can be isolated.

Suitable methods for obtaining cell lysates that preserve the activity or integrity of the mannosidase activity in the lysate can include the use of appropriate buffers and/or inhibitors, including nuclease, protease and phosphatase inhibitors that preserve or minimize changes in N-glycosylation activities in the cell lysate. Such inhibitors include, for example, chelators such as ethylenediamine tetraacetic acid (EDTA), ethylene glycol bis(P-aminoethyl ether) N,N,N1,N1-tetraacetic acid (EGTA), protease inhibitors such as phenylmethylsulfonyl fluoride (PMSF), aprotinin, leupeptin, antipain and the like, and phosphatase inhibitors such as phosphate, sodium fluoride, vanadate and the like. Appropriate buffers and conditions for obtaining lysates containing enzymatic activities are described in, e.g., Ausubel et al. Current Protocols in Molecular Biology (Supplement 47), John Wiley & Sons, New York (1999); Harlow and Lane, Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory Press (1988); Harlow and Lane, Using Antibodies: A Laboratory Manual, Cold Spring Harbor Press (1999); Tietz Textbook of Clinical Chemistry, 3rd ed. Burtis and Ashwood, eds. W.B. Saunders, Philadelphia, (1999).

A cell lysate can be further processed to eliminate or minimize the presence of interfering substances, as appropriate. If desired, a cell lysate can be fractionated by a variety of methods well known to those skilled in the art, including subcellular fractionation, and chromatographic techniques such as ion exchange, hydrophobic and reverse phase, size exclusion, affinity, hydrophobic charge-induction chromatography, and the like.

In some embodiments, a cell lysate can be prepared in which whole cellular organelles remain intact and/or functional. For example, a lysate can contain one or more of intact rough endoplasmic reticulum, intact smooth endoplasmic reticulum, or intact Golgi apparatus. Suitable methods for preparing lysates containing intact cellular organelles and testing for the functionality of the organelles are described in, e.g., Moreau et al. (1991) J. Biol. Chem. 266(7):4329-4333; Moreau et al. (1991) J. Biol. Chem. 266(7):4322-4328; Rexach et al. (1991) J. Cell Biol. 114(2):219-229; and Paulik et al. (1999) Arch. Biochem. Biophys. 367(2):265-273.

Upon contact of a mammalian cell with a molecular complex containing uncapped and demannosylated phosphorylated N-glycans, the molecular complex can be transported to the interior of the mammalian cell (e.g., a human cell). A molecular complex having an uncapped, but not demannosylated, phosphorylated N-glycan does not substantially bind mannose-6-phosphate receptors on mammalian cells, and as such, is not efficiently transported to the interior of the cell. As used herein, “does not substantially bind” means that less than 15% (e.g., less than 14%, 12%, 10%, 8%, 6%, 4%, 2%, 1%, 0.5%, or less, or 0%) of the glycoprotein molecules bind to mannose-6-phosphate receptors on mammalian cells. However, if such a molecular complex is contacted with a mannosidase capable of hydrolyzing a terminal alpha-1,2 mannose linkage or moiety when the underlying mannose is phosphorylated, a demannosylated glycoprotein is produced that substantially binds to the mannose-6-phosphate receptor on the mammalian cells and is efficiently transported to the interior of the cell. As used herein “substantially binds” means that 15% or more (e.g., greater than 16%, 18%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90% or 95%) of the molecular complex binds to mannose-6-phosphate receptors on mammalian cells. It is understood that a preparation (e.g., a recombinant host cell or a cell-free preparation) containing an enzyme that uncaps but does not demannosylate phosphorylated N-glycans could be contaminated with an enzyme that demannosylates phosphorylated N-glycans. A target protein sample after contact with such a preparation can contain protein molecules with some phosphorylated N-glycans that are uncapped only and others that are uncapped and demannosylated. Naturally those protein molecules containing uncapped and demannosylated phosphorylated N-glycans can substantially bind to mannose-6-phosphate receptors. The above definition of “does not substantially bind” does not apply to such a target protein sample since the phosphorylated N-glycans on the protein molecules cannot be characterized as uncapped but not demannosylated.

Thus, this document provides methods of converting a molecular complex from a first form that does not bind to a mannose-6-phosphate receptor on a mammalian cell to a second form that does bind to a mannose-6-phosphate receptor on a mammalian cell. In the first form, the molecular complex in which at least one of the polypeptides in the complex comprises one or more N-glycans containing one or more mannose residues that are linked at the 1 position to a mannose residue that contains a phosphate residue at the 6 position. In such methods, the first form of the molecular complex is contacted with a mannosidase that demannosylates the terminal mannose residues to result in the mannose containing the phosphate at the 6 position to become the terminal mannose. In some embodiments, the mannosidase has both uncapping and demannosylating activity (e.g., Canavalia ensiformis (Jack bean) or Yarrowia lipolytica AMS1 mannosidase). In some embodiments, the mannosidase does not have uncapping activity (e.g., a mannosidase from Aspergillus satoi or a mannosidase from Cellulosimicrobium cellulans (e.g., CcMan4)).

Transport of a glycoprotein or molecular complex to the interior of the cell can be assessed using a cell uptake assay. For example, mammalian cells and a molecular complex containing uncapped and demannosylated phosphorylated N-glycans can be incubated, then the cells washed and lysed. Cell lysates can be assessed for the presence of the GAA complex (e.g., by Western blotting) or by activity of GAA in the cell lysate. For example, uptake can be assessed in fibroblasts deficient in acid alpha glucosidase activity. Intracellular activity of alpha glucosidase can be assessed using the 4-methylumbelliferyl-alpha-D-glucopyranoside (4-MUG) assay. Cleavage of the substrate 4-MUG by a glucosidase leads to the generation of the fluorigenic product 4-MU, which can be visualized or detected by irradiation with UV light.

Recombinant Production of Polypeptides

Isolated nucleic acid molecules encoding polypeptides (e.g., a mannosidase, an alkaline protease, or GAA or a fragment thereof) can be produced by standard techniques. The terms “nucleic acid” and “polynucleotide” are used interchangeably herein, and refer to both RNA and DNA, including cDNA, genomic DNA, synthetic DNA, and DNA (or RNA) containing nucleic acid analogs. Polynucleotides can have any three-dimensional structure. A nucleic acid can be double-stranded or single-stranded (i.e., a sense strand or an antisense strand). Non-limiting examples of polynucleotides include genes, gene fragments, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, siRNA, micro-RNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers, as well as nucleic acid analogs.

An “isolated nucleic acid” refers to a nucleic acid that is separated from other nucleic acid molecules that are present in a naturally-occurring genome, including nucleic acids that normally flank one or both sides of the nucleic acid in a naturally-occurring genome (e.g., a yeast genome). The term “isolated” as used herein with respect to nucleic acids also includes any non-naturally-occurring nucleic acid sequence, since such non-naturally-occurring sequences are not found in nature and do not have immediately contiguous sequences in a naturally-occurring genome.

An isolated nucleic acid can be, for example, a DNA molecule, provided one of the nucleic acid sequences normally found immediately flanking that DNA molecule in a naturally-occurring genome is removed or absent. Thus, an isolated nucleic acid includes, without limitation, a DNA molecule that exists as a separate molecule (e.g., a chemically synthesized nucleic acid, or a cDNA or genomic DNA fragment produced by PCR or restriction endonuclease treatment) independent of other sequences as well as DNA that is incorporated into a vector, an autonomously replicating plasmid, a virus (e.g., any paramyxovirus, retrovirus, lentivirus, adenovirus, or herpes virus), or into the genomic DNA of a prokaryote or eukaryote. In addition, an isolated nucleic acid can include an engineered nucleic acid such as a DNA molecule that is part of a hybrid or fusion nucleic acid. A nucleic acid existing among hundreds to millions of other nucleic acids within, for example, cDNA libraries or genomic libraries, or gel slices containing a genomic DNA restriction digest, is not considered an isolated nucleic acid.

The term “exogenous” as used herein with reference to nucleic acid and a particular host cell refers to any nucleic acid that does not occur in (and cannot be obtained from) that particular cell as found in nature. Thus, a non-naturally-occurring nucleic acid is considered to be exogenous to a host cell once introduced into the host cell. It is important to note that non-naturally-occurring nucleic acids can contain nucleic acid subsequences or fragments of nucleic acid sequences that are found in nature provided that the nucleic acid as a whole does not exist in nature. For example, a nucleic acid molecule containing a genomic DNA sequence within an expression vector is non-naturally-occurring nucleic acid, and thus is exogenous to a host cell once introduced into the host cell, since that nucleic acid molecule as a whole (genomic DNA plus vector DNA) does not exist in nature. Thus, any vector, autonomously replicating plasmid, or virus (e.g., retrovirus, adenovirus, or herpes virus) that as a whole does not exist in nature is considered to be non-naturally-occurring nucleic acid. It follows that genomic DNA fragments produced by PCR or restriction endonuclease treatment as well as cDNAs are considered to be non-naturally-occurring nucleic acid since they exist as separate molecules not found in nature. It also follows that any nucleic acid containing a promoter sequence and polypeptide-encoding sequence (e.g., cDNA or genomic DNA) in an arrangement not found in nature is non-naturally-occurring nucleic acid. A nucleic acid that is naturally-occurring can be exogenous to a particular cell. For example, an entire chromosome isolated from a cell of yeast x is an exogenous nucleic acid with respect to a cell of yeast y once that chromosome is introduced into a cell of yeast y.

Polymerase chain reaction (PCR) techniques can be used to obtain an isolated nucleic acid containing a nucleotide sequence described herein. PCR can be used to amplify specific sequences from DNA as well as RNA, including sequences from total genomic DNA or total cellular RNA. Generally, sequence information from the ends of the region of interest or beyond is employed to design oligonucleotide primers that are identical or similar in sequence to opposite strands of the template to be amplified. Various PCR strategies also are available by which site-specific nucleotide sequence modifications can be introduced into a template nucleic acid. Isolated nucleic acids also can be chemically synthesized, either as a single nucleic acid molecule (e.g., using automated DNA synthesis in the 3′ to 5′ direction using phosphoramidite technology) or as a series of oligonucleotides. For example, one or more pairs of long oligonucleotides (e.g., >100 nucleotides) can be synthesized that contain the desired sequence, with each pair containing a short segment of complementarity (e.g., about 15 nucleotides) such that a duplex is formed when the oligonucleotide pair is annealed. DNA polymerase is used to extend the oligonucleotides, resulting in a single, double-stranded nucleic acid molecule per oligonucleotide pair, which then can be ligated into a vector. Isolated nucleic acids also can be obtained by mutagenesis of, e.g., a naturally occurring DNA.

To recombinantly produce a polypeptide (e.g., a mannosidase, an alkaline protease, or GAA or fragment thereof), a vector is used that contains a promoter operably linked to nucleic acid encoding the polypeptide. As used herein, a “promoter” refers to a DNA sequence that enables a gene to be transcribed. The promoter is recognized by RNA polymerase, which then initiates transcription. Thus, a promoter contains a DNA sequence that is either bound directly by, or is involved in the recruitment, of RNA polymerase. A promoter sequence can also include “enhancer regions,” which are one or more regions of DNA that can be bound with proteins (namely, the trans-acting factors, much like a set of transcription factors) to enhance transcription levels of genes (hence the name) in a gene-cluster. The enhancer, while typically at the 5′ end of a coding region, can also be separate from a promoter sequence and can be, e.g., within an intronic region of a gene or 3′ to the coding region of the gene.

As used herein, “operably linked” means incorporated into a genetic construct (e.g., vector) so that expression control sequences effectively control expression of a coding sequence of interest.

Expression vectors can be introduced into host cells (e.g., by transformation or transfection) for expression of the encoded polypeptide, which then can be purified. Expression systems that can be used for small or large scale production of polypeptides (e.g., a mannosidase, alkaline protease, or GAA or fragment thereof) include, without limitation, microorganisms such as bacteria (e.g., E. coli) transformed with recombinant bacteriophage DNA, plasmid DNA, or cosmid DNA expression vectors containing the nucleic acid molecules, and fungal (e.g., Yarrowia lipolytica, Arxula adeninivorans, Pichia pastoris, Hansenula polymorphs, Ogataea minuta, Pichia methanolica, Aspergillus niger, Trichoderma reesei, and Saccharomyces cerevisiae) transformed with recombinant fungal expression vectors containing the nucleic acid molecules. Useful expression systems also include insect cell systems infected with recombinant virus expression vectors (e.g., baculovirus) containing the nucleic acid molecules, and plant cell systems infected with recombinant virus expression vectors (e.g., tobacco mosaic virus) or transformed with recombinant plasmid expression vectors (e.g., Ti plasmid) containing the nucleic acid molecules. Mannosidase or alkaline protease polypeptides also can be produced using mammalian expression systems, which include cells (e.g., immortalized cell lines such as COS cells, Chinese hamster ovary cells, HeLa cells, human embryonic kidney 293 cells, and 3T3 L1 cells) harboring recombinant expression constructs containing promoters derived from the genome of mammalian cells (e.g., the metallothionein promoter) or from mammalian viruses (e.g., the adenovirus late promoter and the cytomegalovirus promoter).

Recombinant polypeptides such as a mannosidase can be tagged with a heterologous amino acid sequence such FLAG, polyhistidine (e.g., hexahistidine), hemagluttanin (HA), glutathione-S-transferase (GST), or maltose-binding protein (MBP) to aid in purifying the protein. Other methods for purifying proteins include chromatographic techniques such as ion exchange, hydrophobic and reverse phase, size exclusion, affinity, hydrophobic charge-induction chromatography, and the like (see, e.g., Scopes, Protein Purification: Principles and Practice, third edition, Springer-Verlag, New York (1993); Burton and Harding, J. Chromatogr. A 814:71-81 (1998)).

In Vivo Methods of Uncapping and Demannosylating Glycoproteins

Genetically engineered cells described herein can be used to produce molecular complexes having GAA activity. For example, genetically engineered cells can be used to produce molecule complexes having GAA activity and comprising at least two polypeptides, each polypeptide having at least 85% sequence identity to a segment of the amino acid sequence set forth in SEQ ID NO: 1, each segment being derived by proteolysis of the amino acid sequence set forth in SEQ ID NO: 1 at one or more sites between amino acid 50 and amino acid 74 (e.g., between amino acid 56 and amino acid 68 or between amino acid 60 and amino acid 65). For example, a fungal cell can be engineered to include a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO:1 and a nucleic acid encoding an alkaline protease having at least 85% sequence identity to the amino acid sequence set forth in SEQ ID NO:8 such that each of the encoded polypeptides are secreted into the culture medium, where the alkaline protease can cleave the amino acid sequence set forth in SEQ ID NO:1. As described in Example 12, when the recombinant GAA was secreted into the culture medium with the alkaline protease, processing of the 110 kDa precursor to the 95 kDa form was complete, i.e., the 110 kDa precursor was not detected.

Genetically engineered cells described herein also can be used to produce uncapped and demannosylated molecular complexes. Such genetically engineered cells can include a nucleic acid encoding a polypeptide having the amino acid sequence set forth in SEQ ID NO:1, a nucleic acid encoding a mannosidase as described herein, and optionally a nucleic acid encoding an alkaline protease having at least 85% sequence identity to the amino acid sequence set forth in SEQ ID NO:8.

A cell based method of producing uncapped and demannosylated molecule complexes can include introducing into a fungal cell genetically engineered to include a nucleic acid encoding a mannosidase that is capable of hydrolyzing a mannose-1-phospho-6-mannose linkage or moiety to phospho-6-mannose, a nucleic acid encoding a polypeptide having the amino acid sequence set forth in SEQ ID NO: 1 and optionally a nucleic acid encoding an alkaline protease having at least 85% sequence identity to the amino acid sequence set forth in SEQ ID NO:8, wherein the cell produces the molecular complex described herein containing uncapped phosphorylated N-glycans. Such phosphorylated N-glycans can be demannosylated as described above. In some embodiments, the nucleic acids encoding the mannosidase and GAA contain a secretion sequence such that the mannosidase and GAA are co-secreted. In genetically engineered cells that include a nucleic acid encoding an alkaline protease, the molecular complexes can be processed to the 95 kDa form.

Another cell based method can include the steps of introducing into a fungal cell genetically engineered to include a nucleic acid encoding a mannosidase that is capable of (i) hydrolyzing a mannose-1-phospho-6-mannose linkage or moiety to phospho-6-mannose and (ii) hydrolyzing a terminal alpha-1,2 mannose, alpha-1,3 mannose and/or alpha-1,6 mannose linkage or moiety of a phosphate containing glycan, a nucleic acid encoding a polypeptide having the amino acid sequence set forth in SEQ ID NO: 1, and optionally a nucleic acid encoding an alkaline protease having at least 85% sequence identity to the amino acid sequence set forth in SEQ ID NO:8, wherein the cell produces uncapped and demannosylated molecular complexes. In some embodiments, the nucleic acids encoding the mannosidase and GAA contain a secretion sequence such that the mannosidase and target molecule are co-secreted. In genetically engineered cells that include a nucleic acid encoding an alkaline protease, the molecular complexes can be processed to the 95 kDa form.

Cells suitable for in vivo production of target molecules or molecular complexes can be of fungal origin, including Yarrowia lipolytica, Arxula adeninivorans, methylotrophic yeast (such as a methylotrophic yeast of the genus Candida, Hansenula, Oogataea, Pichia or Torulopsis) or filamentous fungi of the genus Aspergillus, Trichoderma, Neurospora, Fusarium, or Chrysosporium. Exemplary fungal species include, without limitation, Pichia anomala, Pichia bovis, Pichia canadensis, Pichia carsonii, Pichia farinose, Pichia fermentans, Pichia fluxuum, Pichia membranaefaciens, Pichia membranaefaciens, Candida valida, Candida albicans, Candida ascalaphidarum, Candida amphixiae, Candida Antarctica, Candida atlantica, Candida atmosphaerica, Candida blattae, Candida carpophila, Candida cerambycidarum, Candida chauliodes, Candida corydalis, Candida dosseyi, Candida dubliniensis, Candida ergatensis, Candida fructus, Candida glabrata, Candida fermentati, Candida guilliermondii, Candida haemulonii, Candida insectamens, Candida insectorum, Candida intermedia, Candida jeffresii, Candida kefyr, Candida krusei, Candida lusitaniae, Candida lyxosophila, Candida maltosa, Candida membranifaciens, Candida milleri, Candida oleophila, Candida oregonensis, Candida parapsilosis, Candida quercitrusa, Candida shehatea, Candida temnochilae, Candida tenuis, Candida tropicalis, Candida tsuchiyae, Candida sinolaborantium, Candida sojae, Candida viswanathii, Candida utilis, Oogataea minuta, Pichia membranaefaciens, Pichia silvestris, Pichia membranaefaciens, Pichia chodati, Pichia membranaefaciens, Pichia menbranaefaciens, Pichia minuscule, Pichia pastoris, Pichia pseudopolymorpha, Pichia quercuum, Pichia robertsii, Pichia saitoi, Pichia silvestrisi, Pichia strasburgensis, Pichia terricola, Pichia vanriji, Pseudozyma Antarctica, Rhodosporidium toruloides, Rhodotorula glutinis, Saccharomyces bayanus, Saccharomyces bayanus, Saccharomyces momdshuricus, Saccharomyces uvarum, Saccharomyces bayanus, Saccharomyces cerevisiae, Saccharomyces bisporus, Saccharomyces chevalieri, Saccharomyces delbrueckii, Saccharomyces exiguous, Saccharomyces fermentati, Saccharomyces fragilis, Saccharomyces marxianus, Saccharomyces mellis, Saccharomyces rosei, Saccharomyces rouxii, Saccharomyces uvarum, Saccharomyces willianus, Saccharomycodes ludwigii, Saccharomycopsis capsularis, Saccharomycopsis fibuligera, Saccharomycopsis fibuligera, Endomyces hordei, Endomycopsis fobuligera. Saturnispora saitoi, Schizosaccharomyces octosporus, Schizosaccharomyces pombe, Schwanniomyces occidentalis, Torulaspora delbrueckii, Torulaspora delbrueckii, Saccharomyces dairensis, Torulaspora delbrueckii, Torulaspora fermentati, Saccharomyces fermentati, Torulaspora delbrueckii, Torulaspora rosei, Saccharomyces rosei, Torulaspora delbrueckii, Saccharomyces rosei, Torulaspora delbrueckii, Saccharomyces delbrueckii, Torulaspora delbrueckii, Saccharomyces delbrueckii, Zygosaccharomyces mongolicus, Dorulaspora globosa, Debaryomyces globosus, Torulopsis globosa, Trichosporon cutaneum, Trigonopsis variabilis, Williopsis californica, Williopsis saturnus, Zygosaccharomyces bisporus, Zygosaccharomyces bisporus, Debaryomyces disporua. Saccharomyces bisporas, Zygosaccharomyces bisporus, Saccharomyces bisporus, Zygosaccharomyces mellis, Zygosaccharomyces priorianus, Zygosaccharomyces rouxiim, Zygosaccharomyces rouxii, Zygosaccharomyces barkeri, Saccharomyces rouxii, Zygosaccharomyces rouxii, Zygosaccharomyces major, Saccharomyces rousii, Pichia anomala, Pichia bovis, Pichia Canadensis, Pichia carsonii, Pichia farinose, Pichia fermentans, Pichia fluxuum, Pichia membranaefaciens, Pichia pseudopolymorpha, Pichia quercuum, Pichia robertsii, Pseudozyma Antarctica, Rhodosporidium toruloides, Rhodosporidium toruloides, Rhodotorula glutinis, Saccharomyces bayanus, Saccharomyces bayanus, Saccharomyces bisporus, Saccharomyces cerevisiae, Saccharomyces chevalieri, Saccharomyces delbrueckii, Saccharomyces fermentati, Saccharomyces fragilis, Saccharomycodes ludwigii, Schizosaccharomyces pombe, Schwanniomyces occidentalis, Torulaspora delbrueckii, Torulaspora globosa, Trigonopsis variabilis, Williopsis californica, Williopsis saturnus, Zygosaccharomyces bisporus, Zygosaccharomyces mellis, or Zygosaccharomyces rouxii. Exemplary filamentous fungi include various species of Aspergillus including, but not limited to, Aspergillus caesiellus, Aspergillus candidus, Aspergillus carneus, Aspergillus clavatus, Aspergillus deflectus, Aspergillus flavus, Aspergillus fumigatus, Aspergillus glaucus, Aspergillus nidulans, Aspergillus niger, Aspergillus ochraceus, Aspergillus oryzae, Aspergillus parasiticus, Aspergillus penicilloides, Aspergillus restrictus, Aspergillus sojae, Aspergillus sydowi, Aspergillus tamari, Aspergillus terreus, Aspergillus ustus, or Aspergillus versicolor. Such cells, prior to the genetic engineering as specified herein, can be obtained from a variety of commercial sources and research resource facilities, such as, for example, the American Type Culture Collection (Rockville, Md.).

Genetic engineering of a cell can include, in addition to an exogenous nucleic acid encoding a mannosidase, GAA, and/or alkaline protease, one or more genetic modifications such as: (i) deletion of an endogenous gene encoding an Outer CHain elongation (OCH1) protein; (ii) introduction of a recombinant nucleic acid encoding a polypeptide capable of promoting mannosyl phosphorylation (e.g, a MNN4 polypeptide from Yarrowia lipolytica, S. cerevisiae, Ogataea minuta, Pichia pastoris, or C. albicans, or PNO1 polypeptide from P. pastoris) to increasing phosphorylation of mannose residues; (iii) introduction or expression of an RNA molecule that interferes with the functional expression of an OCH1 protein; (iv) introduction of a recombinant nucleic acid encoding a wild-type (e.g., endogenous or exogenous) protein having a N-glycosylation activity (i.e., expressing a protein having an N-glycosylation activity); or (v) altering the promoter or enhancer elements of one or more endogenous genes encoding proteins having N-glycosylation activity to thus alter the expression of their encoded proteins. RNA molecules include, e.g., small-interfering RNA (siRNA), short hairpin RNA (shRNA), anti-sense RNA, or micro RNA (miRNA). Genetic engineering also includes altering an endogenous gene encoding a protein having an N-glycosylation activity to produce a protein having additions (e.g., a heterologous sequence), deletions, or substitutions (e.g., mutations such as point mutations; conservative or non-conservative mutations). Mutations can be introduced specifically (e.g., by site-directed mutagenesis or homologous recombination) or can be introduced randomly (for example, cells can be chemically mutagenized as described in, e.g., Newman and Ferro-Novick (1987) J. Cell Biol. 105(4):1587.

Genetic modifications described herein can result in one or more of (i) an increase in one or more activities in the genetically modified cell, (ii) a decrease in one or more activities in the genetically modified cell, or (iii) a change in the localization or intracellular distribution of one or more activities in the genetically modified cell. It is understood that an increase in the amount of a particular activity (e.g., promoting mannosyl phosphorylation) can be due to overexpressing one or more proteins capable of promoting mannosyl phosphorylation, an increase in copy number of an endogenous gene (e.g., gene duplication), or an alteration in the promoter or enhancer of an endogenous gene that stimulates an increase in expression of the protein encoded by the gene. A decrease in one or more particular activities can be due to overexpression of a mutant form (e.g., a dominant negative form), introduction or expression of one or more interfering RNA molecules that reduce the expression of one or more proteins having a particular activity, or deletion of one or more endogenous genes that encode a protein having the particular activity.

To disrupt a gene by homologous recombination, a “gene replacement” vector can be constructed in such a way to include a selectable marker gene. The selectable marker gene can be operably linked, at both 5′ and 3′ end, to portions of the gene of sufficient length to mediate homologous recombination. The selectable marker can be one of any number of genes which either complement host cell auxotrophy or provide antibiotic resistance, including URA3, LEU2 and HIS3 genes. Other suitable selectable markers include the CAT gene, which confers chloramphenicol resistance to yeast cells, or the lacZ gene, which results in blue colonies due to the expression of β-galactosidase. Linearized DNA fragments of the gene replacement vector are then introduced into the cells using methods well known in the art (see below). Integration of the linear fragments into the genome and the disruption of the gene can be determined based on the selection marker and can be verified by, for example, Southern blot analysis. A selectable marker can be removed from the genome of the host cell by, e.g., Cre-loxP systems (see below).

Alternatively, a gene replacement vector can be constructed in such a way as to include a portion of the gene to be disrupted, which portion is devoid of any endogenous gene promoter sequence and encodes none or an inactive fragment of the coding sequence of the gene. An “inactive fragment” is a fragment of the gene that encodes a protein having, e.g., less than about 10% (e.g., less than about 9%, less than about 8%, less than about 7%, less than about 6%, less than about 5%, less than about 4%, less than about 3%, less than about 2%, less than about 1%, or 0%) of the activity of the protein produced from the full-length coding sequence of the gene. Such a portion of the gene is inserted in a vector in such a way that no known promoter sequence is operably linked to the gene sequence, but that a stop codon and a transcription termination sequence are operably linked to the portion of the gene sequence. This vector can be subsequently linearized in the portion of the gene sequence and transformed into a cell. By way of single homologous recombination, this linearized vector is then integrated in the endogenous counterpart of the gene.

Expression vectors can be autonomous or integrative. A recombinant nucleic acid (e.g., one encoding a mannosidase, GAA, or alkaline protease) can be in introduced into the cell in the form of an expression vector such as a plasmid, phage, transposon, cosmid or virus particle. The recombinant nucleic acid can be maintained extrachromosomally or it can be integrated into the yeast cell chromosomal DNA. Expression vectors can contain selection marker genes encoding proteins required for cell viability under selected conditions (e.g., URA3, which encodes an enzyme necessary for uracil biosynthesis or TRP1, which encodes an enzyme required for tryptophan biosynthesis) to permit detection and/or selection of those cells transformed with the desired nucleic acids (see, e.g., U.S. Pat. No. 4,704,362). Expression vectors can also include an autonomous replication sequence (ARS). For example, U.S. Pat. No. 4,837,148 describes autonomous replication sequences which provide a suitable means for maintaining plasmids in Pichia pastoris.

Integrative vectors are disclosed, e.g., in U.S. Pat. No. 4,882,279. Integrative vectors generally include a serially arranged sequence of at least a first insertable DNA fragment, a selectable marker gene, and a second insertable DNA fragment. The first and second insertable DNA fragments are each about 200 (e.g., about 250, about 300, about 350, about 400, about 450, about 500, or about 1000 or more) nucleotides in length and have nucleotide sequences which are homologous to portions of the genomic DNA of the species to be transformed. A nucleotide sequence containing a gene of interest (e.g., a gene encoding GAA) for expression is inserted in this vector between the first and second insertable DNA fragments whether before or after the marker gene. Integrative vectors can be linearized prior to yeast transformation to facilitate the integration of the nucleotide sequence of interest into the host cell genome.

An expression vector can feature a recombinant nucleic acid under the control of a yeast (e.g., Yarrowia lipolytica, Arxula adeninivorans, P. pastoris, or other suitable fungal species) promoter, which enables them to be expressed in fungal cells. Suitable yeast promoters include, e.g., ADC1, TPI1, ADH2, hp4d, PDX, and Gal10 (see, e.g., Guarente et al. (1982) Proc. Natl. Acad. Sci. USA 79(23):7410) promoters. Additional suitable promoters are described in, e.g., Zhu and Zhang (1999) Bioinformatics 15(7-8):608-611 and U.S. Pat. No. 6,265,185.

A promoter can be constitutive or inducible (conditional). A constitutive promoter is understood to be a promoter whose expression is constant under the standard culturing conditions. Inducible promoters are promoters that are responsive to one or more induction cues. For example, an inducible promoter can be chemically regulated (e.g., a promoter whose transcriptional activity is regulated by the presence or absence of a chemical inducing agent such as an alcohol, tetracycline, a steroid, a metal, or other small molecule) or physically regulated (e.g., a promoter whose transcriptional activity is regulated by the presence or absence of a physical inducer such as light or high or low temperatures). An inducible promoter can also be indirectly regulated by one or more transcription factors that are themselves directly regulated by chemical or physical cues.

It is understood that other genetically engineered modifications can also be conditional. For example, a gene can be conditionally deleted using, e.g., a site-specific DNA recombinase such as the Cre-loxP system (see, e.g., Gossen et al. (2002) Ann. Rev. Genetics 36:153-173 and U.S. Application Publication No. 20060014264).

A recombinant nucleic acid can be introduced into a cell described herein using a variety of methods such as the spheroplast technique or the whole-cell lithium chloride yeast transformation method. Other methods useful for transformation of plasmids or linear nucleic acid vectors into cells are described in, for example, U.S. Pat. No. 4,929,555; Hinnen et al. (1978) Proc. Nat. Acad. Sci. USA 75:1929; Ito et al. (1983) J. Bacteriol. 153:163; U.S. Pat. No. 4,879,231; and Sreekrishna et al. (1987) Gene 59:115, the disclosures of each of which are incorporated herein by reference in their entirety. Electroporation and PEG1000 whole cell transformation procedures may also be used, as described by Cregg and Russel, Methods in Molecular Biology: Pichia Protocols, Chapter 3, Humana Press, Totowa, N.J., pp. 27-39 (1998).

Transformed fungal cells can be selected for by using appropriate techniques including, but not limited to, culturing auxotrophic cells after transformation in the absence of the biochemical product required (due to the cell's auxotrophy), selection for and detection of a new phenotype, or culturing in the presence of an antibiotic which is toxic to the yeast in the absence of a resistance gene contained in the transformants. Transformants can also be selected and/or verified by integration of the expression cassette into the genome, which can be assessed by, e.g., Southern blot or PCR analysis.

Prior to introducing the vectors into a target cell of interest, the vectors can be grown (e.g., amplified) in bacterial cells such as Escherichia coli (E. coli) as described above. The vector DNA can be isolated from bacterial cells by any of the methods known in the art which result in the purification of vector DNA from the bacterial milieu. The purified vector DNA can be extracted extensively with phenol, chloroform, and ether, to ensure that no E. coli proteins are present in the plasmid DNA preparation, since these proteins can be toxic to mammalian cells.

In some embodiments, the genetically engineered fungal cell lacks the OCH1 gene or gene products (e.g., mRNA or protein) thereof, and is deficient in OCH1 activity. In some embodiments, the genetically engineered cell expresses a polypeptide capable of promoting mannosyl phosphorylation (e.g., a MNN4 polypeptide from Yarrowia lipolytica, S. cerevisiae, Ogataea minuta, Pichia pastoris, or C. albicans, or a PNO1 polypeptide from P. pastoris). For example, the fungal cell can express a MNN4 polypeptide from Y. lipolytica (Genbank® Acccession Nos: XM_(—)503217, Genolevures Ref: YALI0D24101g). In some embodiments, the genetically engineered cell is deficient in OCH1 activity and expresses a polypeptide capable of promoting mannosyl phosphorylation.

Following uncapping and demannosylation, the molecular complex can be isolated. In some embodiments, the molecular complex is maintained within the yeast cell and released upon cell lysis. In some embodiments, the molecular complex is secreted into the culture medium via a mechanism provided by a coding sequence (either native to the exogenous nucleic acid or engineered into the expression vector), which directs secretion of the molecule from the cell. The presence of the uncapped and demannosylated molecular complex in the cell lysate or culture medium can be verified by a variety of standard protocols for detecting the presence of the molecule, e.g., immunoblotting or radioimmunoprecipitation with an antibody specific for GAA, or testing for a specific enzyme activity (e.g., glycogen degradation).

In some embodiments, following isolation, the uncapped and demannosylated target molecule or molecular complex can be attached to a heterologous moiety, e.g., using enzymatic or chemical means. A “heterologous moiety” refers to any constituent that is joined (e.g., covalently or non-covalently) to the altered target molecule, which constituent is different from a constituent originally present on the altered target molecule. Heterologous moieties include, e.g., polymers, carriers, adjuvants, immunotoxins, or detectable (e.g., fluorescent, luminescent, or radioactive) moieties. In some embodiments, an additional N-glycan can be added to the altered target molecule.

Methods for detecting glycosylation of molecules include DNA sequencer-assisted (DSA), fluorophore-assisted carbohydrate electrophoresis (FACE) or surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF MS). For example, an analysis can utilize DSA-FACE in which, for example, glycoproteins are denatured followed by immobilization on, e.g., a membrane. The glycoproteins can then be reduced with a suitable reducing agent such as dithiothreitol (DTT) or β-mercaptoethanol. The sulfhydryl groups of the proteins can be carboxylated using an acid such as iodoacetic acid. Next, the N-glycans can be released from the protein using an enzyme such as N-glycosidase F. N-glycans, optionally, can be reconstituted and derivatized by reductive amination. For example, the released N-glycans can be labeled with a fluorophore such APTS (8-aminopyrene-1,3,6-trisulfonic acid), at the reducing terminus by reductive amination. The stoichiometry of labeling is such that only one APTS molecule is attached to each molecule of oligosaccharide. The derivatized N-glycans can then be concentrated. Instrumentation suitable for N-glycan analysis includes, e.g., the ABI PRISM® 377 DNA sequencer (Applied Biosystems). Data analysis can be performed using, e.g., GENESCAN® 3.1 software (Applied Biosystems). Isolated mannoproteins can be further treated with one or more enzymes such as calf intestine phosphatase to confirm their N-glycan status. Additional methods of N-glycan analysis include, e.g., mass spectrometry (e.g., MALDI-TOF-MS), high-pressure liquid chromatography (HPLC) on normal phase, reversed phase and ion exchange chromatography (e.g., with pulsed amperometric detection when glycans are not labeled and with UV absorbance or fluorescence if glycans are appropriately labeled). See also Callewaert et al. (2001) Glycobiology 11(4):275-281 and Freire et al. (2006) Bioconjug. Chem. 17(2):559-564.

Cultures of Engineered Cells

This document also provides a substantially pure culture of any of the genetically engineered cells described herein. As used herein, a “substantially pure culture” of a genetically engineered cell is a culture of that cell in which less than about 40% (i.e., less than about: 35%; 30%; 25%; 20%; 15%; 10%; 5%; 2%; 1%; 0.5%; 0.25%; 0.1%; 0.01%; 0.001%; 0.0001%; or even less) of the total number of viable cells in the culture are viable cells other than the genetically engineered cell, e.g., bacterial, fungal (including yeast), mycoplasmal, or protozoan cells. The term “about” in this context means that the relevant percentage can be 15% percent of the specified percentage above or below the specified percentage. Thus, for example, about 20% can be 17% to 23%. Such a culture of genetically engineered cells includes the cells and a growth, storage, or transport medium. Media can be liquid, semi-solid (e.g., gelatinous media), or frozen. The culture includes the cells growing in the liquid or in/on the semi-solid medium or being stored or transported in a storage or transport medium, including a frozen storage or transport medium. The cultures are in a culture vessel or storage vessel or substrate (e.g., a culture dish, flask, or tube or a storage vial or tube).

The genetically engineered cells described herein can be stored, for example, as frozen cell suspensions, e.g., in buffer containing a cryoprotectant such as glycerol or sucrose, as lyophilized cells. Alternatively, they can be stored, for example, as dried cell preparations obtained, e.g., by fluidized bed drying or spray drying, or any other suitable drying method.

Pharmaceutical Compositions and Methods of Treatment

GAA molecules and molecular complexes described herein, e.g., molecular complexes containing at least one modification that enhances transport to the interior of a mammalian cell, can be incorporated into a pharmaceutical composition containing a therapeutically effective amount of the molecule and one or more adjuvants, excipients, carriers, and/or diluents. Acceptable diluents, carriers and excipients typically do not adversely affect a recipient's homeostasis (e.g., electrolyte balance). Acceptable carriers include biocompatible, inert or bioabsorbable salts, buffering agents, oligo- or polysaccharides, polymers, viscosity-improving agents, preservatives and the like. One exemplary carrier is physiologic saline (0.15 M NaCl, pH 7.0 to 7.4). Another exemplary carrier is 50 mM sodium phosphate, 100 mM sodium chloride. Further details on techniques for formulation and administration of pharmaceutical compositions can be found in, e.g., Remington's Pharmaceutical Sciences (Maack Publishing Co., Easton, Pa.). Supplementary active compounds can also be incorporated into the compositions.

Administration of a pharmaceutical composition containing molecular complexes with one or modifications described herein can be systemic or local. Pharmaceutical compositions can be formulated such that they are suitable for parenteral and/or non-parenteral administration. Specific administration modalities include subcutaneous, intravenous, intramuscular, intraperitoneal, transdermal, intrathecal, oral, rectal, buccal, topical, nasal, ophthalmic, intra-articular, intra-arterial, sub-arachnoid, bronchial, lymphatic, vaginal, and intra-uterine administration.

Administration can be by periodic injections of a bolus of the pharmaceutical composition or can be uninterrupted or continuous by intravenous or intraperitoneal administration from a reservoir which is external (e.g., an IV bag) or internal (e.g., a bioerodable implant, a bioartificial organ, or a colony of implanted altered N-glycosylation molecule production cells). See, e.g., U.S. Pat. Nos. 4,407,957, 5,798,113, and 5,800,828. Administration of a pharmaceutical composition can be achieved using suitable delivery means such as: a pump (see, e.g., Annals of Pharmacotherapy, 27:912 (1993); Cancer, 41:1270 (1993); Cancer Research, 44:1698 (1984); microencapsulation (see, e.g., U.S. Pat. Nos. 4,352,883; 4,353,888; and 5,084,350); continuous release polymer implants (see, e.g., Sabel, U.S. Pat. No. 4,883,666); macroencapsulation (see, e.g., U.S. Pat. Nos. 5,284,761, 5,158,881, 4,976,859 and 4,968,733 and published PCT patent applications WO92/19195, WO 95/05452); injection, either subcutaneously, intravenously, intra-arterially, intramuscularly, or to other suitable site; or oral administration, in capsule, liquid, tablet, pill, or prolonged release formulation.

Examples of parenteral delivery systems include ethylene-vinyl acetate copolymer particles, osmotic pumps, implantable infusion systems, pump delivery, encapsulated cell delivery, liposomal delivery, needle-delivered injection, needle-less injection, nebulizer, aerosolizer, electroporation, and transdermal patch.

Formulations suitable for parenteral administration conveniently contain a sterile aqueous preparation of the altered N-glycosylation molecule, which preferably is isotonic with the blood of the recipient (e.g., physiological saline solution). Formulations can be presented in unit-dose or multi-dose form.

Formulations suitable for oral administration can be presented as discrete units such as capsules, cachets, tablets, or lozenges, each containing a predetermined amount of the altered N-glycosylation molecule; or a suspension in an aqueous liquor or a non-aqueous liquid, such as a syrup, an elixir, an emulsion, or a draught.

A molecular complex containing at least one modification that enhances transport of the complex to the interior of a mammalian cell that is suitable for topical administration can be administered to a mammal (e.g., a human patient) as, e.g., a cream, a spray, a foam, a gel, an ointment, a salve, or a dry rub. A dry rub can be rehydrated at the site of administration. Such molecules can also be infused directly into (e.g., soaked into and dried) a bandage, gauze, or patch, which can then be applied topically. Such molecules can also be maintained in a semi-liquid, gelled, or fully-liquid state in a bandage, gauze, or patch for topical administration (see, e.g., U.S. Pat. No. 4,307,717).

Therapeutically effective amounts of a pharmaceutical composition can be administered to a subject in need thereof in a dosage regimen ascertainable by one of skill in the art. For example, a composition can be administered to the subject, e.g., systemically at a dosage from 0.01 μg/kg to 10,000 μg/kg body weight of the subject, per dose. In another example, the dosage is from 1 μg/kg to 100 μg/kg body weight of the subject, per dose. In another example, the dosage is from 1 μg/kg to 30 μg/kg body weight of the subject, per dose, e.g., from 3 μg/kg to 10 μg/kg body weight of the subject, per dose.

In order to optimize therapeutic efficacy, a molecular complex described herein can be first administered at different dosing regimens. The unit dose and regimen depend on factors that include, e.g., the species of mammal, its immune status, the body weight of the mammal. Typically, levels of such a molecular complex in a tissue can be monitored using appropriate screening assays as part of a clinical testing procedure, e.g., to determine the efficacy of a given treatment regimen.

The frequency of dosing for a molecular complex described herein is within the skills and clinical judgement of medical practitioners (e.g., doctors or nurses). Typically, the administration regime is established by clinical trials which may establish optimal administration parameters. However, the practitioner may vary such administration regimes according to the subject's age, health, weight, sex and medical status. The frequency of dosing can be varied depending on whether the treatment is prophylactic or therapeutic.

Toxicity and therapeutic efficacy of such molecular complexes or pharmaceutical compositions thereof can be determined by known pharmaceutical procedures in, for example, cell cultures or experimental animals. These procedures can be used, e.g., for determining the LD₅₀ (the dose lethal to 50% of the population) and the ED₅₀ (the dose therapeutically effective in 50% of the population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the ratio LD₅₀/ED₅₀. Pharmaceutical compositions that exhibit high therapeutic indices are preferred. While pharmaceutical compositions that exhibit toxic side effects can be used, care should be taken to design a delivery system that targets such compounds to the site of affected tissue in order to minimize potential damage to normal cells (e.g., non-target cells) and, thereby, reduce side effects.

The data obtained from the cell culture assays and animal studies can be used in formulating a range of dosage for use in appropriate subjects (e.g., human patients). The dosage of such pharmaceutical compositions lies generally within a range of circulating concentrations that include the ED₅₀ with little or no toxicity. The dosage may vary within this range depending upon the dosage form employed and the route of administration utilized. For a pharmaceutical composition used as described herein (e.g., for treating a metabolic disorder in a subject), the therapeutically effective dose can be estimated initially from cell culture assays. A dose can be formulated in animal models to achieve a circulating plasma concentration range that includes the IC₅₀ (i.e., the concentration of the pharmaceutical composition which achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can be used to more accurately determine useful doses in humans. Levels in plasma can be measured, for example, by high performance liquid chromatography.

As defined herein, a “therapeutically effective amount” of a molecular complex is an amount of the complex that is capable of producing a medically desirable result (e.g., amelioration of one or more symptoms of Pompe's disease) in a treated subject. A therapeutically effective amount (i.e., an effective dosage) can includes milligram or microgram amounts of the complex per kilogram of subject or sample weight (e.g., about 1 microgram per kilogram to about 500 milligrams per kilogram, about 100 micrograms per kilogram to about 5 milligrams per kilogram, or about 1 microgram per kilogram to about 50 micrograms per kilogram).

The subject can be any mammal, e.g., a human (e.g., a human patient) or a non-human primate (e.g., chimpanzee, baboon, or monkey), a mouse, a rat, a rabbit, a guinea pig, a gerbil, a hamster, a horse, a type of livestock (e.g., cow, pig, sheep, or goat), a dog, a cat, or a whale.

A molecular complex or pharmaceutical composition thereof described herein can be administered to a subject as a combination therapy with another treatment used for Pompe's disease. For example, the combination therapy can include administering to the subject (e.g., a human patient) one or more additional agents that provide a therapeutic benefit to the subject who has, or is at risk of developing (e.g., due to a mutation in the gene encoding GAA) Pompe's disease. Thus, the compound or pharmaceutical composition and the one or more additional agents can be administered at the same time. Alternatively, the molecular complex can be administered first and the one or more additional agents administered second, or vice versa.

Any of the molecular complexes described herein can be lyophilized.

Any of the pharmaceutical compositions described herein can be included in a container, pack, or dispenser together with instructions for administration. In some embodiments, the composition is packaged as a single use vial.

The following are examples of the practice of the invention. They are not to be construed as limiting the scope of the invention in any way.

EXAMPLES Example 1 Uncapping and De-Mannosylation of Recombinant huGAA with CcMan5 and Jack Bean α-Mannosidase

Recombinant human GAA (rhGAA) was produced as described in WO2011/039634 using Y. lipolytica production strain OXYY1589, which contains three copies of the human alpha glucosidase gene (also known as acid alpha glucosidase or acid maltase EC3.2.1.3) and two copies of the Y. lipolytica MNN4 gene. The amino acid sequence of human GAA is set forth in FIG. 1. The genotype of strain OXY1589 is as follows:

-   MatA, leu2-958, ura3-302, xpr2-322, -   gut2-744, ade2-844 -   POX2-Lip2pre-huGAA:URA3Ex::zeta -   POX2-Lip2pre-huGAA:LEU2Ex::zeta -   POX2-Lip2pre-hGM-CSF: GUTEx::zeta -   Y1MNN4-POX2-hp4d-YLMNN4:ADE2::PT targeted

RhGAA was uncapped and demannosylated with Cellulosimicrobium cellulans mannosidase (CcMan5) and Jack bean a mannosidase (JbMan) (Sigma Product M7257, 3.0 M ammonium sulphate suspension). CcMan5 was produced recombinantly by first cloning the nucleic acid encoding the CcMan5 polypeptide (FIG. 2A) into vector pLSAH36, which contains a DsbA signal sequence and results in the expression of a protein with an N-terminal HIS tag. FIGS. 2B and 2C contain the amino acid sequence of the CcMan5 polypeptide with and without signal sequence, respectively. Plasmid pLSAH36 was cloned into E. coli B21 cells and proteins residing in the periplasm were isolated and purified using a Talon column. Before using the ammonium sulphate suspension of JbMan, it was further purified by gel filtration through a Superdex 200 column to remove contaminating phosphatase activities.

RhGAA (concentration of about 5 mg/mL in 20 mM sodium acetate (NaOAc) buffer, pH 5.0) was uncapped and demannosylated by incubating with CcMan5 (about 0.15-0.30 mg/mL in phosphate buffered saline (PBS)) and JbMan (about 0.5-1 mg/mL in PBS) in a w:w ratio of 100:5:10 for huGAA:CcMan5:JbMan. The total reaction volume was diluted with 500 mM NaOAc buffer, pH 5.0 and 100 mM CaCl₂ to obtain final concentrations of 100 mM NaOAc and 2 mM CaCl₂. The reaction mixture was incubated at 30° C. for 16 hours.

To evaluate the uncapping process and to analyze the N-glycan profile of the purified huGAA, the N-glycans of 5 μg glycoprotein were released with N-Glycosidase F (PNGaseF), labeled with APTS (8-amino-1,3,6-pyrenetrisulfonic acid; trisodium salt) and subsequently analyzed on DSA-FACE (DNA Sequencer-Aided Fluorophore-Assisted Carbohydrate Electrophoresis). The method essentially follows the protocol described in Laroy et al, Nature Protocols, 1:397-405 (2006).

The DSA-FACE electropherograms of the N-glycans from huGAA (76 kD form) before (panel B) and after (panel C) treatment with CcMan5 and JbMan are presented in FIG. 3A. Panel A is a reference sample containing the N-glycans released from RNaseB with PNGaseF. The N-glycan mixture released from capped huGAA is mainly composed of ManP-Man₈GlcNAc₂ and (ManP)₂-Man₈GlcNAc₂ (FIG. 3A, panel B). A peak running slightly faster than ManP-Man₈GlcNAc₂ can be assigned to ManP-Man₇GlcNAc₂. The main peaks observed after uncapping and demannosylation can be assigned to the double phosphorylated P₂-Man₆GlcNAc₂ and the monophosphorylated P-Man₄GlcNAc₂, P-Man₅GlcNAc₂ and P-Man₆GlcNAc₂ (Panel C).

The uncapping of different processed forms of huGAA results in the same N-glycan profiles (FIG. 3B) for the 76 kD form (Panel B), 95 kD form (Panel C) and 110 kD form (Panel D).

Example 2 Purification of 110 kDa rhGAA

The 110 kDa form of rhGAA was isolated from strain OXYY1589 as follows. After harvest, the broth was centrifuged and filtered using a Durapore membrane (Merck Millipore). Ammonium sulphate (AMS) was added to a concentration of 1 M and the solute was filtered before loading on a hydrophobic interaction chromatography (HIC) column, equilibrated in 20 mM sodium phosphate pH 6, 1 M ammonium sulphate. The product was eluted with 20 mM sodium phosphate pH 6.

Before loading on a second chromatography column, the product was first concentrated via tangential flow filtration (TFF) on a regenerated cellulose membrane, then exchanged from buffer to 20 mM sodium acetate pH 4.5. This material was loaded on a cation exchange chromatography (CEX) column, equilibrated with 20 mM sodium acetate pH 4.5. After loading the column, it was washed with equilibration buffer until the UV absorbance signal reached baseline, and then washed with 20 mM sodium acetate pH 4.5, 50 mM NaCl. The product was eluted in 20 mM sodium acetate pH 4.5, 150 mM NaCl, and concentrated and exchanged from buffer to 20 mM sodium acetate pH 5. (See below)

The sample was uncapped and demannosylated as described in Example 1 then D-mannitol was added to a concentration of 100 mM. Three quarters of that material was reduced in volume via TFF using a regenerated cellulose membrane having a 10 kDa molecular weight cut off (MWCO) and purified further via size exclusion chromatography (SEC) on a Superdex 200 column equilibrated at 4° C. with 25 mM sodium phosphate pH 6, 150 mM NaCl, 100 mM D-mannitol. Fractions were screened afterwards for purity on cibacron-blue stained polyacrylamide gels under denaturating conditions. Pooled fractions were concentrated via TFF and ultracentrifuged using Amicon centrifugal filters of 10 kD MWCO (regenerated cellulose membrane, Merck Millipore).

Example 3 Purification of 110 kDa rhGAA

The 110 kDa form of rhGAA was isolated from strain OXYY1589 as follows. After harvest, the material was centrifuged and filtered before the concentration of AMS was increased to 1 M. The solute was again filtered and the product was captured on a HIC column, equilibrated with 20 mM sodium phosphate pH 6, 1 M AMS, and released in a step gradient from 1 to 0 M AMS in a 20 mM sodium phosphate pH 6 buffer.

The eluate was concentrated and buffer exchanged to 10 mM BIS-TRIS, pH 6 via TFF on a Vivaflow 200 module (PES membrane, 10 kD MWCO, Sartorius). The desalted material was brought onto an anion exchange chromatography (AEC) column. After washing of the column until the UV signal almost reached baseline, a two-phase continuous salt gradient was applied; the first gradient going from 0 to 0.3 M NaCl, the second from 0.3 to 1 M NaCl. Fractions were collected during the gradient and screened for GAA via a qualitative 4-methylumbellifferyl-α-D-glucopyranoside (4-MUG). In the 4-MUG assay, reactions were started by adding a reaction buffer consisting of 0.35 mM 4-MUG, 0.1% BSA and 100 mM sodium acetate pH 4 in a 10:1 volume proportion to 10 μl of elution fraction. After incubating for 30 minutes to 1 hour at 37° C., an equal volume of 100 mM glycine pH 11 was added to stop the reaction. The release of the fluorogenic reaction product 4-methylumbelliferone was observed under UV-light.

Fractions containing GAA were pooled and concentrated via TFF on a Vivaflow 200 module (PES membrane, 10 kD MWCO, Sartorius) and ultracentrifugation using Amicon centrifugal filters of 10 kD MWCO (regenerated cellulose membrane, Merck Millipore).

The concentrated material was split in two and purified further on a Superdex 200 column equilibrated at 4° C. with 50 mM sodium phosphate pH 6, 250 mM NaCl, 0.05% Tween-20. Fractions were screened afterwards for purity on cibacron-blue stained polyacrylamide gels under denaturing conditions, and phosphatase content was determined using a colormetric test using para-nitrophenylphosphate, which measures the enzymatic release of the yellow colored p-nitrophenolate reaction product at a wavelength of 405 nm.

Pilot pools were made from fractions containing GAA. The total protein of the pilot pools was determined via the Bradford assay. Selected fractions were pooled for concentration onto a Vivaflow 200 TFF module (PES membrane, 10 kD MWCO, Sartorius). The volume was further reduced using 15 ml Amicon centrifugal filters of 10 kD MWCO (regenerated cellulose membrane, Merck Millipore).

The concentrated material was subjected to a second round of size exclusion chromatography (SEC) using the same conditions as for the first SEC step. Fractions were again screened for purity on cibacron-blue stained polyacrylamide gels under denaturing conditions. Fractions were pooled according to the chosen pilot pool and concentrated on 15 ml Amicon centrifugal filters (10 kD MWCO, regenerated cellulose membrane, Merck Millipore).

Example 4 Purification of 95 kDa rhGAA

The 95 kDa form of rhGAA was isolated from strain OXYY1589 as follows.

After harvest, the broth was centrifuged and filtered using a Durapore membrane (Merck Millipore). The product was afterwards concentrated via TFF on a modified polyethersulfone (PES) membrane with a molecular-weight-cut-off (MWCO) of 10 kD. AMS was added to a concentration of 1 M and the solute was filtered before loading on a HIC column, equilibrated in 20 mM sodium phosphate pH 6, 1 M AMS. The product was eluted with water, the pH of the eluate was adjusted by adding a stock buffer of 100 mM BIS-TRIS pH 6 to a concentration of 10 mM, and it was stored at 4° C. for 13 days.

Before loading on an AEX column, the product was concentrated via TFF on a regenerated cellulose membrane with an MWCO of 10 kD and buffer-exchanged to 10 mM BIS-TRIS pH 6. The desalted material was processed further via AEX chromatography, performed as described in Example 3. Fractions were collected during the gradient and screened for GAA via the qualitative 4-MUG assay. Fractions containing GAA were pooled for further purification.

For the third chromatography step, the concentration of AMS was increased to 1 M, and, after filtration, the material was further purified via HIC. A continuous salt gradient from 1 to 0 M AMS was applied while collecting fractions during the gradient. All fractions were screened for GAA via the qualitative assay and those containing GAA were pooled for further processing.

The pool was concentrated via ultra-centrifugation using 15 ml Amicon centrifugal filters of 10 kD MWCO regenerated cellulose membrane and further purified via SEC using the same procedures as described in Example 3. Fractions were screened afterwards for purity on cibacron-blue stained polyacrylamide gels under denaturing conditions. The 90% pure GAA fractions were pooled and first concentrated on a TFF Vivaflow 200 module (PES membrane, 10 kD MWCO, Sartorius), and then subjected to ultra-centrifugation using 15 ml Amicon centrifugal filters (10 kD MWCO, regenerated cellulose membrane, Merck Millipore). The concentrated material was subjected to a second round of SEC using the same conditions as for the first SEC step. Fractions were screened for GAA using the qualitative 4-MUG GAA assay. Fractions having GAA activity were pooled and concentrated.

After uncapping and demannosylation as set forth in Example 1, D-mannitol was added to a concentration of 100 mM and the volume was again reduced before loading onto a final Superdex 200 gel filtration column, equilibrated at 4° C. with 25 mM sodium phosphate pH 6, 150 mM NaCl, 100 mM D-mannitol. Fractions were screened for GAA using the 4-MUG qualitative assay, and those containing the product were pooled and concentrated.

Example 5 Purification of 95-110 kDa rhGAA Mix

Both the 110 kDa precursor and 95 kDa form of rhGAA was isolated from strain OXYY1589 as follows. After harvest, the material was processed to the second chromatography step as described in Example 2. After the HIC step, the product was concentrated and the buffer exchanged to 10 mM BIS-TRIS pH 6 via TFF, and loaded on an AEX column. The product was eluted in a single step from 0 to 300 mM NaCl at pH 6 (10 mM BIS-TRIS) and then concentrated using a Pellicon XL50 TFF module (regenerated cellulose membrane with a 10 kD MWCO). Half of the material was further purified via size exclusion chromatography. The chromatography step was performed as described in Example 3, but the selection of the fractions for further processing was only done on the basis of purity on cibacron-blue stained polyacrylamide gels under denaturing conditions.

Half of the pool was concentrated and combined with the remainder of the AEX-material. After uncapping and demannosylation, the concentration of D-mannitol was increased to 100 mM and the subsequent concentration and SEC steps were done following the same procedures as described in Example 2. Fractions were pooled on the basis of the 4-MUG qualitative assay and pooled with uncapped product from Example 6.

Example 6 Purification of 95 kDa rhGAA

The 95 kDa form of rhGAA was isolated from strain OXYY1589 as follows. After harvest, the material was processed up to and including the AEX step as described in Example 3. In the AEX step, a significant amount of the product resided in the flow through fraction due to an increase of conductivity during the loading. The flow through material was therefore again diafiltered to the appropriate buffer and subjected to a second round of AEX chromatography. Both amounts (batch A and batch B) were from here on processed separately.

Batch A was combined with the remainder of the SEC pool from Example 5 and the remainder of the CEX pool from Example 2 and the pool subsequently concentrated and diafiltered to a buffer containing 10 mM BIS-TRIS pH 6, 300 mM NaCl. The material was incubated at 30° C. for 65 h. The pH then was lowered to pH 5 by adding a 1 M stock buffer of sodium acetate pH 5 to a concentration of 125 mM, and the sample was again incubated at 30° C. After 24 h, the product was treated with Flavourzyme (Novozymes Corp), a protease mix from Aspergillus oryzae, using a 40:1 weight:weight ratio total protein content of the product versus protease mix, and was for the last time incubated at 30° C. After an overnight incubation, the material was purified via a first SEC step, performed under the same conditions as described in Example 3. Fractions were pooled that were estimated to contain pure product on the basis of cibacron-blue stained polyacrylamide gels under denaturing conditions. After concentration and buffer exchange to 20 mM sodium acetate pH 5, the material was uncapped and demannosylated as set forth in Example 1. D-mannitol was added to a concentration of 100 mM and the material was pooled with uncapped and demannosylated material from Example 5. The final SEC step and subsequent sample manipulations were performed as described in Example 2.

Batch B was pooled with end material from Example 3 and the pool was concentrated and diafiltered to a buffer containing 10 mM BIS-TRIS pH 6, 300 mM NaCl. The product was then treated with the A. oryzae protease mix for an overnight incubation period at 30° C. using the same weight ratios as described in Example 5, and, afterwards, purified via a first SEC step, performed under the same conditions as described in Example 3. Further processing of the product was done as described in Example 5.

In the final batch, product from batch A and batch B were mixed in 14:1 ratio in GAA content.

Example 7 Purification of 76 kDa rhGAA

The 76 kDa form of rhGAA was isolated from strain OXYY1589 as follows. After harvest, the culture was subjected to two rounds of continuous centrifugation. The supernatant was pooled and AMS was introduced to a concentration of approximately 1 M. After dissolution, 1 volume of HIC resin, pre-equilibrated in 20 mM sodium phosphate pH 6, 1 M AMS, was added to 50 volumes of supernatant while stirring to bind the product in a batch uptake mode. The resulting slurry was stored overnight at 4° C. without stirring. During this period, a brown colored layer settled at the top of the solute that was removed in the morning via gentle aspiration. The resin was washed three times with three volumes of lead buffer (20 mM sodium phosphate pH 6, 1 M AMS) in each round before it was packed into a column. The packed resin was washed until UV signal almost reached baseline and the product was afterwards eluted with water. The pH of the eluate was adjusted by adding a stock buffer of 100 mM BIS-TRIS pH 6 to a concentration of 10 mM. The material was then sterile filtered in a bag and stored for a period of eleven days at 4° C.

The second and third chromatography steps and accompanying manipulations of the material were performed as described in Example 4. The pool after the third chromatography step was first concentrated approximately seven times on two TFF Vivaflow 200 modules coupled in parallel (PES membrane, 10 kD MWCO, Sartorius), and then ultra-centrifuged using 15 ml Amicon centrifugal filters of 10 kD MWCO (regenerated cellulose membrane, Merck Millipore). The concentrated material was split in two and purified further via SEC using the same conditions as described for Example 4. Fractions were screened afterwards for purity on cibacron-blue stained polyacrylamide gels under denaturing conditions. Selected fractions were pooled for concentration onto two Vivaflow 200 TFF modules coupled in parallel (PES membrane, 10 kD MWCO, Sartorius). The volume was further reduced using 15 ml Amicon centrifugal filters of 10 kD MWCO (regenerated cellulose membrane, Merck Millipore).

After uncapping and demannosylation, D-mannitol was added to a concentration of 100 mM and the sample was again concentrated on a Vivaflow 50 TFF module (PES membrane, 10 kD MWCO, Sartorius) before loading onto a final SEC column, performed in the same way as described in Example 4. Product containing fractions were pooled and concentrated.

Example 8 Enzymatic Characterization of the Different Variants of huGAA (76, 95 and 110 kD Variants) Using the Artificial Chromogenic Substrate p-Nitrophenyl-α-D-Glucopyranoside

The artificial chromogenic substrate p-nitrophenyl-α-D-glucopyranoside (PNPG) was used to determine the kinetic parameters of the unprocessed huGAA (110 kDa) obtained in Example 2 and the processed huGAA variants obtained in Example 7 (76 kDa), Example 6 (95 kDa) and Example 4 (95 kDa) A comparison also was made with the commercial human α-glucosidase, Myozyme® (alglucosidase alpha, Genzyme).

The enzymes were diluted to 20 μg/ml in 100 mM sodium acetate buffer pH 4.0, containing 0.1% BSA and 100 mM AMS (reaction buffer). Ten μl of the enzyme solutions were added to a 96-well plate in triplicate. The PNPG substrate (Sigma) was diluted to various substrate concentrations (10, 6, 4, 2, 1.6, 1.2, 0.8, and 0.4 mM) in reaction buffer and 90 μl of the diluted substrate was added to each well. The enzymatic reaction was incubated for 60 min at 37° C. followed by the addition of 100 μl 10% sodium carbonate, pH 12 to quench the reaction. The absorbance was measured at 405 nm. A standard curve with p-nitrophenol (PNP) was measured to calculate the amount of product formed per minute. The velocity expressed as μM/min was plotted in function of the different substrate concentrations generating a Michaelis-Menten curve. GraphPad Prism was used to calculate the Vmax and Km according to a direct fit to the Michaelis-Menten equation. The catalytic constant kcat and the catalytic efficiency kcat/Km were calculated. The specific activity of the various enzymes was reported as U/mg where 1 unit is defined as the amount of enzyme that catalyzes the hydrolysis of 1 nmol substrate per minute at 2 mM substrate concentration in 100 mM sodium acetate buffer, pH 4.0+0.1% BSA and 100 mM AMS. The results are shown in Table 1.

TABLE 1 95 kDa 76 kDa 110 kDa 95 kDa Myozyme (Ex. 4) (Ex. 7) (Ex. 2) (Ex. 6) Vmax 12 12 14 13 13 (μM/min) Km (mM) 4.4 4.4 4.3 4.4 4.7 kcat (min⁻¹) 660 677 770 688 730 kcat/Km 150 154 179 156 155 (min⁻¹mM−¹) Sp. Activity 2000 1910 2415 1935 1980 (U/mg)

The unprocessed and processed forms of huGAA and Myozyme have comparable kinetic parameters towards the substrate PNPG. This is in accordance with data reported in literature for human acid α-glucosidase from Mouse milk and Chinese hamster ovary (CHO) medium (Bijvoet et at (1998), Human Molecular Genetics, 7, 1815-1824). The unprocessed (110 kD) and the processed (76 kD) form were reported to have the same Km and kcat value for the artificial substrate 4-methylumbelliferyl-α-D-glucopyranoside.

Example 9 Enzymatic Characterization of the Different Variants of huGAA (76, 95 and 110 kD Variants) Using Rabbit Liver Glycogen as the Substrate

The enzymatic parameters of the unprocessed huGAA (110 kD variant; Example 2) and the processed huGAA variants (76 kDa, Example 7; and 95 kD, Example 6) were tested using rabbit liver glycogen (lot N° 099K37931V, Sigma). A comparison was made with the commercial human α-glucosidase, Myozyme® (alglucosidase alpha, Genzyme). The enzymes were diluted to 500 ng/ml in 100 mM sodium acetate buffer pH 4.0. 50 μl of the enzyme solutions were added to a 96-well plate in duplicate. The glycogen substrate was diluted to various substrate concentrations (250, 200, 150, 100, 75, 50, 25 mg/ml) in acetate buffer and 100 μl of the diluted substrate was added to each well. The enzymatic reaction was incubated for 60 min at 37° C. The amount of glucose was measured using the glucose-oxidase method with the amplex red substrate.

A glucose standard curve was measured to calculate the amount of product formed per minute. The enzyme velocity expressed as μM/min was plotted in function of the different substrate concentrations generating a Michaelis-Menten curve. GraphPad Prism was used to calculate the Vmax, and Km according to a direct fit to the Michaelis-Menten equation. The catalytic constant kcat and the catalytic efficiency kcat/Km were calculated. The specific activity of the various enzymes was reported as U/mg where 1 unit is defined as the amount of enzyme that catalyses the formation of 1 μmol glucose per minute at 50 mg/ml final substrate concentration in 100 mM sodium acetate buffer, pH 4.0. The results are shown in Table 2.

TABLE 2 76 kDa 95 kDa 110 kDa Myozyme (Ex. 7) (Ex. 6) (Ex. 2) Vmax 32 ± 6  15 ± 2 13 ± 1 11 ± 1 (μM/min) Km (mM) 600 ± 140 100 ± 10 93 ± 8 162 ± 17 kcat (min⁻¹) 21100 10000 8600 7260 kcat/Km 35 100 92 45 (min⁻¹mM−¹) Sp. Activity 14 32 27 16 (U/mg)

In this experiment substrate saturation cannot be reached due to the limited solubility of rabbit glycogen (FIG. 4). For Myozyme, only an apparent Km and kcat value were calculated. For the three huGAA variants, lower apparent Km values were determined. The catalytic efficiency of the processed forms is two fold higher than the catalytic efficiency of unprocessed huGAA and Myozyme.

Example 10 Effect of Acid Alpha Glucosidase on Glycogen Clearance in a Mouse Model of Pompe Disease

The GAA products from Example 7 (76 kDa, uncapped and demannosylated), Example 6 (95 kDa, uncapped and demannosylated), and Example 2 (110 kDa, uncapped and demannosylated) were administered to a mouse model of Pompe's disease to determine the glycogen clearance from skeletal muscle and heart.

The experiment was performed with FVB GAA knockout mice and FVB wild type mice. This animal model was chosen as a test system since it is a good representative for the early-onset infantile form of the disease. From birth onwards, the KO mice have a generalized and progressive accumulation of lysosomal glycogen (Bijvoet et al., 1998, supra). Male and female FVB GAA KO mice were obtained from the Erasmus University, Rotterdam. At the start of the treatment, mice were between 26-49 weeks of age.

The test substances or vehicle were administered intravenously by slow bolus in the tail veil with a dose volume of 10 ml/kg body weight (bw) once weekly for four weeks. Mice were fasted 16 hours prior to necropsy. Animals were sacrificed four days after the last injection. Details of the study groups are shown in Table 3.

TABLE 3 Group/color Dose level Dose volume Type of N° of code (mg/kg bw) (ml/kg bw) mice mice 1/White 0 10 WT 9 2/Blue 0 10 KO 16 3/Green 20 mg/kg 76 kDa 10 KO 16 4/Red 20 mg/kg 95 kDa 10 KO 16 5/Yellow 20 mg/kg 110 kDa 10 KO 16 6/Orange 20 mg/kg Myozyme 10 KO 16 Perfusion and Homogenization of Organs

Heart and skeletal muscles (quadriceps femuralis, both sides) were isolated after perfusion with PBS. Tissue was homogenized in 10 weight volumes of ice cold PBS by using an ultra-turrax. Thereafter, the homogenate was sonicated at 16 micron on ice twice for 15 min. After centrifugation for 30 min at 12000 g, supernatant was collected for the measurement of glycogen.

Bioanalysis

The glycogen content in heart and skeletal muscle of each individual mouse was measured using a validated quantitative enzymatic assay. After boiling the tissues, a mixture of amyloglucosidase and α-amylase was added in vitro for the degradation of glycogen towards glucose. The amount of glucose was measured using the glucose-oxidase method with the amplex red substrate. The amount of glycogen is reported as μg glycogen/mg protein.

Statistical Analysis

Glycogen content in heart from groups 2, 3, 4, 5 were analyzed by ANOVA followed by post hoc comparison to group 6 (Myozyme) and to group 2 (placebo) by Dunnet's ttest. Group 1 was left out of the statistical analysis and was used as a quality check for lack of glycogen storage in the WT mouse model.

Because of the presence of outlying observations in the quadriceps data, a Kruskal-Wallis test was used to evaluate potential differential distribution of the glycogen content data of the different products.

Post hoc analysis of the quadriceps data was performed with the Wilcoxon rank sum test. Each product group and the Myozyme group was compared with the placebo (group 2) group, and each product group was compared with Myozyme.

Results

Table 4 shows the average glycogen levels (μg/mg protein) in heart (A) and skeletal muscle (B) of 16 mice per group.

TABLE 4 Summary Mean sd A. Average glycogen levels in heart WT 0.58 0.95 KO/Placebo 525.47 67.75 KO/76 kDa 377.75 80.20 KO/95 kDa 380.56 78.30 KO/110 kDa 416.56 106.77 KO/Myozyme 475.83 98.16 B. Average glycogen levels in skeletal muscle WT 2.22 0.66 KO/Placebo 191.80 34.75 KO/76 kDa 152.27 35.35 KO/95 kDa 169.27 46.68 KO/110 kDa 160.39 36.46 KO/Myozyme 186.49 40.61

FIG. 5 shows the glycogen levels (μg/mg protein) of individual mice in heart (5A) and skeletal muscle (5B). The results show that the GAA products produced herein (110 kDa, 95 kDa, and 76 kDa) statistically reduce glycogen levels in heart compared to placebo-treated mice after four intravenous injections at 20 mg/kg. The same Myozyme® dose did not reduce the amount of glycogen in the heart. The glycogen levels in both the 76 kDa product and the 95 kDa treated groups were statistically different compared to the Myozyme®-treated group. Statistically, there was no difference between the three different GAAproducts produced herein.

The 76 kDa product produced herein also statistically reduced the amount of glycogen in skeletal muscle compared to placebo-treated or Myozyme®-treated mice. The glycogen levels in both the 95 kDa and the 110 kDa product were not statistically different compared to placebo and Myozyme®-treated mice, likely due to a higher variation between the individual mice. Myozyme® at 20 mg/kg was not capable of reducing the glycogen levels in skeletal muscle compared to placebo.

Example 11 Identification of a Protease from Aspergillus oryzae

GAA undergoes specific proteolytic cleavage upon incubation with low quantities of Flavourzyme (Novozymes Corp), a protease mix from Aspergillus oryzae, at acidic pH. The resulting GAA product has a molecular weight of approximately 95 kD on SDS-PAGE under reducing conditions. A similar proteolytic activity was observed in certain partially purified GAA preparations containing background proteins from the production strain (Yarrowia lipolytica).

To evaluate the proteolytic event, the N-glycans of GAA were removed to a single N-acetyl glucosamine per N-glycosylation site using EndoH, prior to proteolytic treatment. This allows more adequate evaluation via SDS PAGE. The GAA product was then incubated with the Flavourzyme protease cocktail or purified samples thereof. The reaction was performed at 30° C. in a 100 mM sodium acetate buffer pH 5. Samples were taken at different time points and analyzed via SDS-PAGE under reducing conditions. Volumes containing 0.5 μg of GAA were loaded per lane.

To investigate which protease family is responsible for the specific proteolysis of GAA in the protease cocktail, protease inhibitors were included in the assays that are specific to defined protease families to facilitate the identification of the protease. The reactions were performed as described above, with the exception that protease inhibitors were now added to the reaction mixture. The irreversible inhibitors PMSF (Sigma-Aldrich prod. nr. E5134-500G) and E-64 (Calbiochem prod. nr. CALB324890-5) were, prior to the proteolysis reaction, incubated with the diluted protease cocktail at a concentration of 1 mM and 10 μM respectively. The reversible inhibitors chymostatin (Calbiochem prod. nr. CALB230790-5), EDTA, and leupeptin (Calbiochem prod. nr. CALB108976-10MG) were directly added to the reaction mixture at a concentration of 60 μg/ml, 50 mM and 100 μM, respectively.

The specific proteolysis of GAA was inhibited by PMSF and chymostatin, protease inhibitors that abolish the activity of serine and cysteine proteases. The irreversible inhibitor E-64, which inhibits cysteine proteases, did not block the proteolysis. These data suggest that the specific proteolysis is a serine protease family member. More evidence supporting this hypothesis was provided by additional assays where the protease cocktail was incubated with PMSF and the redox agent dithiotheitol (DTT), which reduces disulfide bonds. Addition of this reducer reduces the covalent inactive cysteine protease:PMSF adduct, restoring the cysteine protease activity. When inhibited by PMSF, the activity of serine proteases can not be recovered by DTT. This difference in behavior was used to further discriminate between serine and cysteine proteases acting on GAA.

Incubation of the PMSF-inhibited protease with DTT did not restore the GAA-specific proteolysis activity of the protease cocktail. The GAA-specific proteolysis also was not inhibited by the metallo-protease inhibitor EDTA and a broad spectrum inhibitor leupeptin. All data indicate that a serine protease is responsible for this GAA proteolytic event.

In order to identify the protease from the mixture, the protease was purified using a series of chromatography steps. The first chromatography step used an anion exchange chromatography resin (Q-Sepharose FF, GE healthcare). The protease cocktail material was diluted in a 20 mM TRIS-HCl buffer pH 7 prior to loading. The flow through and the elutions at 100 mM, 300 mM and 500 mM NaCl in a 20 mM TRIS-HCl buffer were collected. All flow-through and elution fractions were analyzed using the assay as described above. The protease acting on GAA was present in the flow-through fraction of the run and was significantly enriched compared to the starting material.

The flow-through material was further processed via cation exchange chromatography (SP sepharose XL (GE Heathcare) at pH5 10 mM Na Acetate; elution with 0-300 mM NaCl). Elution fractions were collected and analyzed via instant blue stained SDS PAGE, and assayed for the presence of the protease of interest using the assay as described above.

The majority of the activity was present in the last fractions of the CEX chromatography eluate. The last two fractions were pooled and analyzed via mass spectrometry as follows. The protein mixture was desalted, reduced and alkylated prior to trypsin digestion and subsequently subjected to an LC-MS/MS methodology. Acquired spectra were matched onto the NCBI database using the Mascot algorithm. The following settings were applied:

-   -   Trypsin, Chymotrypsin (up to 4 miscleavages allowed)     -   Oxidation (M,W), deamidation (N,Q) (variable modifications)     -   Carbamidomethylation (fixed modification)     -   Taxonomy: Eukaryotes     -   MS tolerance: 0.05 Da, MS/MS tolerance: 0.05 Da

An alkaline protease from Aspergillus (GenBank Accession No. BAA00258.1; gi 217809) was identified from the search. The sequence of the mature protease is:

>gi|217809|dbj|BAA00258.1| alkaline protease [Aspergillus oryzae] (SEQ ID NO: 8) GLTTQKSAPWGLGSISHKGQQSTDYIYDTSAGEGTYAYVVDSGVNVDHE EFEGRASKAYNAAGGQHVDSIGHGTHVSGTIAGKTYGIAKKASILSVKV FQGESSSTSVILDGFNWAANDIVSKKRTSKAAINMSLGGGYSKAFNDAV ENAFEQGVLSVVAAGNENSDAGQTSPASAPDAITVAAIQKSNNRASFSN FGKVVDVFAPGQDILSAWIGSSSATNTISGTSMATPHIVGLSLYLAALE NLDGPAAVTKRIKELATKDVVKDVKGSPNLLAYNGNA.

SDS-PAGE gel analysis of the purified protease from A. oryzae shows the presence of a band at a MW around 30 kDa (mature protease) and several bands with a MW between 20 and 10 kDa. The low MW bands were excised from the gel, trypsin digested, and analyzed by nano-LC-MS/MS. These bands were identified as products from the A. oryzae alkaline protease, indicating the alkaline protease from A. oryzae is susceptible to autoproteolysis.

Example 12 Expression of the Aspergillus oryzae Protease in Yarrowia lipolytica

The present example describes the construction of Y. lipolytica expressing the mature protein ALP. The gene encoding the alkaline protease (ALP) from Aspergillus oryzae (EC. 3.4.21.63) was codon optimized for Y. lipolytica expression and chemically synthesized as a fusion construct. The fusion construct encoded the entire open reading frame (ORF) of the enzyme including signal peptide (21 amino acids), pro-peptide (100 amino acids) and mature protein (282 amino acids) followed by a linker (SGGG) and a His Tag (10×His residue). See FIG. 9. The complete nucleotide sequence of the fusion construct is shown in FIG. 10.

The synthetic ORF of ALP was cloned into the pPT vector series, as BamHI/AvrII fragments, for targeted integration into the Y. lipolytica genome, utilizing different loci for stable integration of the expression cassette. In the pPT vectors, the bacterial moiety is derived from the plasmid pHSS6, and comprises a bacterial origin of replication (ori) and the kanamycin-resistant gene conferring resistance to kanamycin (KanR). The integration cassette comprises a) a selectable marker for transformation to Y. lipolytica (URA3; LEU2; ADE2), b) the expression cassette composed of a promoter (PDX2; Hp4d) c) a multiple cloning site (MCS) to insert the ALP synthetic construct and d) the terminator of the YlLIP2 gene. The integration cassette is flanked by upstream (P) and downstream (T) sequences of a specific locus for stable single copy targeted integration into Y. lipolytica genome by homologous recombination. Two NotI restriction sites enable the isolation of the expression cassette before transformation to avoid integration of the bacterial moiety.

The media and techniques used for Y. lipolytica is described by Barth and Gaillardin (FEMS Microbiol Rev., 19(4):219-37, 1997), yeast cells were transformed by the lithium acetate method described by Le Dall et al. (Curr Genet., 26(1):38-44, 1994), using 1 μg of purified integration cassette and standard techniques used for E. coli.

The integration of the expression cassette ALP was performed at one free locus and at 2 specific loci based on the fact that the insertion provides elimination of the expression of highly secreted proteins (lipase 2 and lipase 8) unwanted during the fermentation process. The final strain OXYY2184 contains 3 expression cassettes of ALP driven by the semi-constitutive Hp4D promoter.

OXYY2184 produces the recombinant Aspergillus oryzae ALP mature form (35 kDa), yielding about 2 to 2.5 g/L fermentation broth on average. Total protein was assayed using the Bradford technique and the protease activity was measured using an assay with azocasein as substrate. Proteases digest the azocasein towards casein and the free azo dye. Precipitation and centrifugation of the digested proteins allow the free azo dye to be measured at alkaline conditions, which is an indication of the proteolytic activity. The absorbance of this product is measured at OD 440 nm. The amount of digested azocasein can be calculated by correlation with an azocasein dilution series with known concentrations of which the absorbance is measured at OD 440 nm.

ALP in the culture supernatant of strain OXYY2184 was assayed by SDS-PAGE and immunodetected with an anti-His polyclonal antibody. The recombinant ALP produced in Y. lipolytica was active and had similar properties as the purified native enzyme. These enzyme properties of the recombinant ALP permit its use to process the rhuGAA precursor to obtain the 95 kDa rhuGAA form.

Strain OXYY2122 was constructed to co-express the ALP and rhuGAA. One copy of the ALP expression cassette was integrated into a recipient strain expressing the rhuGAA (4 copies of rhuGAA). Both genes encoding huGAA and ALP are driven under the inducible PDX2 promoter. The resulting strain OXYY2122 produces the mature form of ALP together with the rhuGAA precursor (110 Kda). Recombinant huGAA in the culture supernatant of strain OXYY2122 was assayed by SDS-PAGE followed by immunoblotting, and confirmed that the rhGAA was processed to the 95 kDa form in the supernatant. This processing was complete; no 110 kDa form was detected, whereas in the same cultivation of the strain without ALP no processing occurred.

Example 13 Purification of 95 kDa rhGAA Obtained after Treatment of rhGAA Fermentation Broth with the Aspergillus oryzae Alkaline Protease Expressed in Yarrowia lipolytica

The 95 kDa form of rhGAA was isolated from strain OXYY1589 as follows. After harvest, the broth was clarified using ceramic membranes (Pall Corporation). The product was concentrated via hollow fiber membranes with a molecular-weight-cut-off (MWCO) of 10 kD. AMS was added to a concentration of 1 M and the solute was heated to 30° C. prior to filtration. The filtrate was treated with A. oryzae alkaline protease recombinantly expressed in Yarrowia lipolytica (strain OXYY2184) and used after clarification of the fermentation broth without any further purification. A weight:weight ratio of 200:1 for total protein:protease and incubation for 16 h at 30° C. resulted in a full proteolysis to the 95 kDa product.

Analysis after further purification and after uncapping and demannosylation of the phosphorylated N-glycans revealed a 95 kDa GAA product (as observed on SDS-PAGE) with similar specific activity on PNPG as reported in Table 1.

Example 14 Identification of the Proteolytic Cleavage Site in rhGAA after Treatment with Aspergillus Oryzae Alkaline Protease (ALP)

rhGAA was treated with the Aspergillus oryzae ALP and further purified as described in the above examples. To facilitate sequence analysis, the purified sample was treated with PNGaseF to deglycosylate the rhGAA as PNGase F deaminates the N-glycosylated asparagine residues in the sequence to aspartate.

To confirm the sequence of rhGAA, the deglycosylated protein was digested using trypsin following reduction of the disulfide bridges and alkylation of the cysteine residues. The resulting peptide mixture was subjected to LC-MS and MS/MS and the data were matched onto the gene-encoded protein sequence thereby determining identity. Accurate mass (<10 ppm) and fragmentation spectra were criteria used for absolute identification.

Nearly full sequence coverage was obtained from the peptide mixture (residues 23-60, 65-535, and 538-898) and the proteolytic cleavage site was determined to be between amino acids 60 and 65 (sequence numbering according to SEQ ID NO: 1). The gap in the rhGAA sequence between residues 60 and 65 could result from a proteolytic event before Gly62 and/or before Gly65. It is reported in literature that the alkaline protease from Aspergillus oryzae degrades the synthetic peptide Ileu-Gln-Asn-Cys-Pro-Leu-Gly-NH2 (SEQ ID NO:12) between Leu and Gly (see Nakadai et al., 1973, Agr. Biol. Chem., 37, 2685-2694).

The proteolytic cleavage site determined in this experiment is in accordance with the proteolytic processing of GAA observed in the lysosomes. See, Moreland et al., 2005, J. Biol. Chem., 280, 6780-6791, where for the 95 kDa polypeptide, the cleavage site was identified between amino acid 59 and amino acid 68 (sequence numbering according to SEQ ID NO: 1). The cleaved N-terminal peptide remains associated via an interchain disulfide bond.

Other Embodiments

While the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims. 

What is claimed is:
 1. A method for making a molecular complex having acid alpha glucosidase (GAA) activity, said method comprising contacting a GAA protein having at least 90% sequence identity to the amino acid sequence set forth in SEQ ID NO:1 with a protease having at least 85% sequence identity to the amino acid sequence set forth in SEQ ID NO:8, wherein said protease cleaves said protein at one or more sites between amino acid 60 and amino acid 65 of SEQ ID NO:1 to produce a molecular complex having GAA activity and comprising at least two polypeptides.
 2. The method of claim 1, wherein said contacting is performed in vitro.
 3. The method of claim 1, further comprising proteolysis of the protein at one or more sites between amino acid 719 and amino acid 746 of the amino acid sequence set forth in SEQ ID NO:1.
 4. The method of claim 1, further comprising proteolysis of the protein at one or more sites between amino acid 137 and amino acid 151 of the amino acid sequence set forth in SEQ ID NO:1.
 5. The method of claim 1, wherein the contacting occurs in a recombinant fungal cell.
 6. The method of claim 5, wherein the fungal cell is a cell of Yarrowia lipolytica, Arxula adeninivorans, or a methylotrophic yeast.
 7. The method of claim 6, wherein the methylotrophic yeast is a cell of a genus selected from the group consisting of Candida, Hansenula, Oogataea, Pichia, and Torulopsis.
 8. The method of claim 1, further comprising altering the molecular complex with at least one modification that results in enhanced ability of the molecular complex to be transported to the interior of a mammalian cell, wherein: the at least one modification comprises the recognition domain of human insulin-like growth factor II; or at least one of said polypeptides comprises one or more phosphorylated N-glycans and wherein said modification comprises uncapping and demannosylation of at least one phosphorylated N-glycan.
 9. The method of claim 8, wherein at least 40% of the N-glycans on said polypeptide are uncapped and demannosylated.
 10. The method of claim 8, wherein at least 60% of the N-glycans on said polypeptide are uncapped and demannosylated.
 11. The method of claim 8, wherein at least 80% of the N-glycans on said polypeptide are uncapped and demannosylated.
 12. The method of claim 8, wherein at least 90% of the N-glycans on said polypeptide are uncapped and demannosylated. 